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(57) Abstract: The invention relates generally to genes that encode proteins that inhibit axonal growth. The invention relates specif- 
ically 10 genes encodign NgR protein homologs in humans and mice. The invention also includes compositions and methods for 
^ modulating the expression and activity of Nogo and the NgR proteins. Specifically, the invention includes peptides, proteins and 
^ antibodies that block Nogo-mediated inhibition of axonal extension. The compositions and methods of the invention are useful in 
<^ the treatment of cranial o cerebral trauma, sprial cord injury, stroke or a demyelinating disease. fflP 
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NOGO RECEPTOR HOMOLOGS 

FIELD OF THE INVENTION 

The invention relates to neurology and molecular biology. More particularly, 
the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism communicate 
with each other and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surface. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
are a first essential link for translating an extracellular signal into a cellular 
physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 
distal tips known as the growth cone. Growth cones enable the neuron to sense the 
local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the neuron toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sensitive to several guidance cues, for example, 
surface adhesiveness, growth factors, neurotransmitters and electric fields. The 
guidance of growth at the cone depends on various classes of adhesion molecules, 
intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 
elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues (e.g., Richardson et al y (1980) Nature 284, 
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264-265) demonstrated that CNS axons can in fact elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, given the appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 
5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an important barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 

10 oligodendrocytes that synthesize myelin in the CNS (Schwab et aL, (1993) Anna. Rev. 
Neurosci. 16, 565-595).* Myelin-associated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an interruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 

15 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David et aL, (1999) W0995394547; Bandman et aL, (1999) U.S. Patent 
No, 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 

20 (MAG), which have inhibitory activity for axonal extension, have been described as 
well (Schwab et aL, (1990) WO9005191; Schwab et aL, (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the effects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. 

25 Neurol 109,2-5). The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested function (Nagase etaL, (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et aL, 
(2000) Nature 403, 439-444; Chen et al. y (2000) Nature 403, 434-439; Prinjha at aL, 

30 (2000) Nature 403, 383-384). 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective 
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expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et aL, (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spillmann et aL, (1998) J. Biol. 
5 Chem. 272, 19283-19293). Antibodies {e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann et aL, (1998) J. Biol. Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for inhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 

10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et aL, (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be expected to improve axonal 

15 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab et aL, (1994) W09417831; Tatagiba et aL, (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 
aL, (1993) U.S. Patent No. 5,250,414); Schwab etaL, (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, infarction and 

25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab et aL, (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proc. Natl. Acad Set 87, 4130-4133; 
Keirstead et al. 9 (1992) Proc. Natl. Acad. Sci. 89, 1 1664-1 1668). After Nogo is 

30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 

Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and NgR3 genes have been identified and compared to domains in mouse 
5 and human NgRl polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX 11 V 

X| 2 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 X 2 o D L X 21 X 22 L X^ X 24 X 25 D 

15 

X 26 X 27 X 28 C [SEQ ID NO: 19] 

wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

Preferably, XI 7 and X23 are (independently) arginine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-3 10 of 
SEQ ID NO:2, or residues 261-3 10 of SEQ ID NO: 2 with up to 10 conservative 
25 amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino acid sequence: 

CPXj X 2 C X 3 C Y X 4 X 5 P Xg X 7 T X 8 S C X9 X 10 X^ X 12 X 13 X 14 X^ X^ P 
X 17 X 18 X 19 P X 20 X 21 X 22 X^ R X 24 F L X 25 X^ N X 27 I X 28 X^ X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 38 X 39 X 40 X 41 X 42 L W X^ X^ S N X 45 X^ X 47 X 48 I X 49 
30 X^ X 51 X 52 F X^ X 54 X^ X 56 X5 7 L E X^ L D L X 59 D N Xgo X^ L X 62 X^ X^ 

P T F Xg-7 G L Xgg Xg 9 L X 70 X 71 L X 72 L X 73 X 74 C X 75 L X 76 X^ L X 78 
X 79 X 80 X 81 F X g2 G L X 83 X 84 L Q Y L Y L Q X 85 N X 86 Xg, Xs 8 L Xso D 
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X.^F^DL^NL^HLFLHGN^ X^X^X^X^ X 101 X 102 
^103 ^io4 F R G L X 105 X 106 L D R L L L H X 107 N X 108 X 109 X u0 X m V H X m 

A F X 114 X 115 L X 116 R L X 117 X U8 L X 119 L F X 120 N X 121 L X 122 X 123 L 
X124 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 LRLN X 134 N X 135 W 
X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 151 X 152 X 153 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
D X 161 X 162 X 163 C [SEQ ID NO: 18] 



wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 

10 polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, X 6 , X 37 and X 38 may represent a gap. Specific examples of polypeptides of 
the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 

15 CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a functional GPI domain, a functional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of the invention is also provided. The 
method includes introducing a nucleic acid encoding the above-described polypeptide 

25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is complementary to a nucleotide sequence encoding a polypeptide selected from the 
group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a 

30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic acid encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 
5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutically acceptable carrier. 

The invention also provides a method for decreasing inhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous system disease, 

disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated using molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, 
Marchiafava-Bignami disease, Spongy degeneration, Alexanders disease, Canavan's 
disease, metachromatic leukodystrophy and Krabbe's disease. 

The invention also provides a method for identifying a molecule that binds a 

20 ' polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 

25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 

30 apparent from the detail description and from the claims. 

BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A-1B shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID 
NO:4) with the known NgR, NgRl (SEQ ID NO:5) and the Consensus Sequence 
(SEQIDNO:6). 

5 

Fig. 2. mNgR3 does not bind hNogoA(1055-l 120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-l 120). 

10 Fig.3. An alignment of the amino acid sequences of human NgRl, murine 

NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
acid #1 of murine NgR3. The consensus sequence is listed below. The LRR NT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 

15 boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides (e.g., 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR homologs, referred to herein as NgR. Unless indicated otherwise, as used herein, 

25 the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by 
number, e.g., "NgR2 M is a human NgR homolog, "NgR3 " is a murine-derived NgR 
homolog, and "NgRl" is the known NgR identified by Dr. Stephen Strittmatter. 

30 Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA, cDNA and DNA that has been chemically synthesized in whole 
or in part. 
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Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skill in the art include Ausubel et aL, CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et al, Molecular Cloning: A Laboratory Manual, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1989); Kaufman et al , Eds., 
Handbook of Molecular and Cellular Methods in Biology and Medicine, 
CRC Press, Boca Raton (1995); McPherson, Ed., DIRECTED MUTAGENESIS: A 
Practical Approach, IRL Press, Oxford (1991). 

As used herein, the term "axon" refers to a long cellular protrusion from a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth 11 refers to an extension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
pathological state associated with abnormal function of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS function resulting from 

physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target cell. 

As used herein, the term "neurite" refers to a process growing out of a neuron. 
As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose function is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "region" is meant a 
physically contiguous portion of the primary structure of a bicmolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 

10 addition to all or part of that region. Examples of NgR protein domains include, but 
are not limited to, the signal peptide, extracellular (i.e., N-terminal) domain, and 
leucine-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, i.e., 

15 having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar functions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 

20 Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay 

system in increasing concentrations to inhibit the binding of Nogo to NgRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 

25 and/or NgR3 as a biological indicator of NgR function. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab 1 , F(ab)2, and other fragments thereof. Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 

30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 
10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 
15 salts, solutions etc. Contacting includes, for example, placing the compound into a 

beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or 
fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereof, refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of proteins. Such isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. 
25 Alternatively, isoforms can be encoded by different genes. Homologous nucleotide 

sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl. Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
acid substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison WI), using the default 

5 settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic acid that has been removed 

0 from its native environment. Examples of isolated nucleic acid molecules include, but 
are not limited to, recombinant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an 
"isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 

5 sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 

0 nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herein, the term "heterologous" refers to a nucleotide or amino acid 

5 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different species. For example, a mouse NgR nucleotide or amino acid sequence 
is heterologous to a human NgR nucleotide or amino acid sequence, and a human NgR 
nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide 
or amino acid sequence. 

0 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 
NgR2 and NgR3 polypeptides that lack a sufficient portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of 
NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR 

5 polypeptides include full-length or truncated (e.g., with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, 
and from amino acid 1 to about amino acid 438 of NgR3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g. , all or part of the 

10 signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID 
NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID NO: 4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID NO:8) and the mature NgR3 (SEQ ID 
NO:9) are used. 

15 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich region (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO:l 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID NO: 2, or amino acid 1 through about amino acid 

20 438 ofSEQIDNO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 
40 through about amino acid 438 of NgR3, the C-terminal 10 amino acids of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 

25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal elongation through inhibition of theNogo ligand binding to NgRl, 
or by inhibiting the ligand of the NgR from interacting with cell surface NgR. The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 

30 FcyRI (macrophage) and FcyIII (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides useful in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 
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be derived from intact transmembrane NgR molecules by proteolysis using specific 
endopeptidases in combination with exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 
5 recombinant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression. 

Preferably, the soluble polypeptides useful in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 

10 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are expressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences 

15 coding for the desired peptide also can be derived from the full- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR synthesis of the 
specified region from cDNA. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 

20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs:l or 3 as a hybridization probe, NgR nucleic 
acid sequences can be isolated using standard hybridization and cloning techniques 
(e.g., as described in Sambrook et al 9 eds., MOLECULAR CLONING: ALABORATORY 

25 MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

1989; and Ausubel, et al y eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John 
Wiley & Sons, New York, NY, 1993). 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

30 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or feNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

15 depending on use. They are used in the detection of identical, similar or 

complementary nucleic acid sequences. Longer length probes are usually obtained from 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefiilly designed to have 
specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors 

25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 

some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (z.e., slowing or stopping) of cell death; (c) inhibition of 

30 degeneration; (d) relieving to some extent one or more of the symptoms associated 

with the abnormal condition; and (e) enhancing the function of the affected population 
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of cells. Compounds demonstrating efficacy against abnormal conditions can be 
identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and inflammation. 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
15 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protein kinases are associated with the apoptosis pathways. Aberrations in the 
function of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist within the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in cell 

25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells outside of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microinjection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification' 1 it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA. In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 
10 from left to right. The amino and carboxy groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5' to 3' direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof. It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 

20 transcripts that undergo one or more splicing events wherein intron (i.e. 9 non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 

25 invention therefore are encoded by the same original genomic DNA sequences but 
arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 

30 opposed to non-naturally occurring variants arising from in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementary strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13. A preferred DNA of the invention comprises a double 
5 stranded molecule comprising the coding molecule (i.e., the "coding strand") along 

with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, 
10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR. More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
15 sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR. 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgR DNA. Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 

20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Generally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 

25 NOs:l, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 

30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 
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Exaraples of related polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 
5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PCR analysis and are useful in animal models for NgR disorders. Knowledge of the 
sequence of a human NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PCR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 

10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express NgR. 
Polynucleotides of the invention may also provide a basis for diagnostic methods useful 
for identifying a genetic alterations) in a NgR locus that underlies a disease state or 
states, which information is useful both for diagnosis and for selection of therapeutic 

15 strategies. 

The disclosure herein of a full-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the full-length polynucleotide. The invention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 

20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only (/.e., 
"specifically") to polynucleotides encoding NgR (or fragments thereof). 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length 
sequence derived from introns, regulatory regions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 

30 through use of alignment programs routinely utilized in the art, e.g., those made 

available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly useful as probes for detection of. 
5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be; 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding NgR polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 

10 complement, of the polynucleotide in any of SEQ ED NOs: 1 or 3. 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 

15 hybridized to each other. A non-limiting example of stringent hybridization conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon 
sperm DNA at 65 °C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the invention that 

20 hybridizes under stringent conditions to the sequence of SEQ ID NOs: 1 or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature {e.g., encodes a natural protein). 
As used herein, "stringent hybridization conditions" means: 42°C in a hybridization 

25 solution comprising 50% formamide, 1% SDS, 1 MNaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0. 1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgR. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 
5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 

10 into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 

1 5 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out inE. colt with 

20 vectors containing constitutive or inducible promoters directing the expression of 

either fusion or non-fixsion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: (1) to increase expression of 
recombinant protein; (2) to increase the solubility of the recombinant protein; and (3) 

25 to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 

30 sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRTT5 (Pharmacia, 
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Piscataway, N.J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amrann et al. y (1988) Gene 69, 301-315) and pET 1 Id (Studier et al 9 GENE 
5 Expression Technology: Methods in Enzymology 1 85, Academic Press, San 
Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, Gene Expression Technology: 

10 Methods in Enzymology 185, Academic Press, SanDiego, CA. (1990) 119-128. 

Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et aL 9 (1992) Nucleic Acids Res. 20, 21 1 1- 
21 18). Such alteration of nucleic acid sequences of the invention can be carried out by 

15 standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl 
(Baldari, et aL 9 (1987) EMBO J. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30, 933-943), pJRY88 (Schultz etal y (1987) Gene 54, 113-123), pYES2 

20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 
CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells {e.g., SF9 cells) include the pAc series (Smith etal t (1983) Mol. Cell. Biol. 3, 

25 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 3 1-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC 
(Kaufinan et al (1987) EMBO J. 6, 187-195). When used in mammalian cells, the 

30 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al, 
(Eds.) Molecular Cloning: A Laboratory Manual. 2 nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
In another embodiment, the recombinant mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 

10 (1988) Adv. Immunol. 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (1989) EMBO J. 8, 729-733) and immunoglobulins (Banerji et 
al (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl Acad. Sci. USA 86, 5473-5477), pancreas-specific promoters (Edlund et al. 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk 
whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g., the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense NgR mRNA. Regulatory sequences operatively linked 

25 to a nucleic acid cloned in the antisense orientation can be chosen that direct the 

continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA, 
The antisense expression vector can be in the form of a recombinant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation 
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of gene expression using antisense genes see Weintraub et aL 9 Antisense RNA as a 
molecular tool for genetic analysis, REVIEWS-TRENDS IN GENETICS, Vol. 1(1) 1986. 

Preferred vectors include, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 

10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 

pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 

15 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 

20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 

25 sequence encoding suitable mRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 

30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 

5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (The BACTERIOPHAGE LAMBDA, Hershey, A.D. 

10 (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1973), which is 
incorporated herein by reference in its entirety; LAMBDA II, Hendrix, R.W. (Ed.), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1980), which is 
incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of E. coli and the SV40 early promoter (Benoist et al, (1981) Nature 290, 

15 304-3 10, which is incorporated herein by reference in its entirety). Additional 

promoters include, but are not limited to, mouse mammary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene ell 
of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by 
DNA encoding NgR and result in the expression of the mature NgR protein. 

25 Moreover, suitable expression vectors can include an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook et al, supra. 

An origin of replication can also be provided either by construction of the 

30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or 
' thymidine kinase (see, U.S. Patent No. 4,399,216). 
5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 
10 disclosed by Sambrook et al y supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
etal, (1983)Mo/. Cell Biol 3:280, Cosmane* a/. (1986) Mol Immunol 23:935, 
Cosman et al, (1984) Nature 3 12:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herein by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 

20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and 

protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immunoreactive with NgR Host cells of the 

30 invention are also useful in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the cells, or from the medium in which the cells 
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are grown, by purification methods known in the art, e.g., conventional 
chromatographic methods including immunoaffinity chromatography, receptor affinity 
chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 
5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is 
expressed and purified as a fusion protein having a specific tag, label or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 

10 Cleavage of the fusion component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

Knowledge of NgRDNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified {e.g., by 
homologous recombination) to provide increased expression by replacing, in whole or 

15 in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. If linked to the NgR coding sequence, amplification of the marker 

25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 

30 functional NgR or that express a variant of NgR. Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are useful as models for studying the 
in vivo activities of NgR and modulators of NgR. 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are 

5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus. 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 

0 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure {see, Tissue Culture, Academic Press, Kruse 

5 and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia and Kluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast plasmid, an autonomously 

0 replication sequence (ARS), a promoter region, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 

5 expression system (see, Luckow et aL, Bio/Technology, 1988, 6, 47; Baculovirus 
Expression Vectors: A Laboratory Manual, OTtielly et aL (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 

0 production in insect cells. 

Suitable host cells are discussed further in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
5 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid {e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
1 0 transfecting host cells can be found in Sambrook, et ah (MOLECULAR CLONING: A 

Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
15 may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encoding a selectable marker can be introduced into a host cell on the same vector as 
that encoding NgR or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 
In a preferred embodiment, the polypeptides of the invention, including forms 
25 of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NgR/Ig 

fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR protein or 
polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-1 1, pXTl, 
pRc/CMV, pRc/RSV and pcDNADSfeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are: SRoc, SV40, LTR, CMV, and HSV-TK. Of these, CMV and Sra 
promoters are preferred. 

- In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
S V40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 

10 portions, fragments and soluble constructs thereof, include the vector (such as one 

described above), into which the promoter is operably linked (preferably upstream) to 
the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin 

15 resistant gene is also operably contained in the vector. 

CHO cell lacking the DHFR gene (Urlaub, G. etal, (1980) Proc. Natl Acad. 
Set USA 77, 4216-4220) and CHO-K1 {Proa Natl Acad. Sci. USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 

20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol 52, 456-467) and electroporation (Nuemann et 
al, (1982) EMBO J. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 

25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene amplification and greater expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutively expressing the NgR constructs. 

30 Media used in cultivating CHO cells includes DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40°C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) NgR protein. Accordingly, the 
5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) in a suitable medium such that NgR protein is produced. In another 
embodiment, the method further comprises isolating NgR from the medium or the host 
10 cell. 

In situations where the NgR polypeptide will be found primarily intracellularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 

15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifugation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. 

20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a reducing agent such as dithiothreitol at alkaline pH or 
tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 

25 electrophoresis, immunoprecipitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for example, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) MetK Enzyniol. 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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activity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 
5 necessarily the same chaotrope as used for the solubilization. It may be required to 

employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shuffling to occur in the 
formation of the protein f s cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 
10 dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To 

increase the efficiency of the refolding, it may be necessary to employ a cosolvent, such 
as glycerol, polyethylene glycol of various molecular weights and arginine. 

Transgenic Animals 

15 The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 
20 homologous recombinant animals in which endogenous NgR sequences have been 

altered. Such animals are useful for studying the function and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 
25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
30 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs:l or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 

10 further above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR 
transgene to direct expression of NgR protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 

1 5 animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in 
Manipulating the MOUSE Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 

20 NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can further be bred to other transgenic animals carrying other transgenes. 
' To create a homologous recombinant animal, a vector is prepared which 

25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 
has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The 
NgR gene can be a human gene (e.g., SEQ ID NOs:l or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For example, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 

30 recombination vector suitable for altering an endogenous NgR gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted (Le., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NgR gene is mutated or otherwise altered but still 
5 encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3* 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the exogenous NgR gene carried by the vector and an 

10 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' 
ends) are included in the vector. See e.g., Thomas etal (1987) Cell 51:503 for a 
description of homologous recombination vectors. The vector is introduced into an 

15 embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 

NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., Li et al (1992) Cell 69:915). 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 

25 DNA by germline transmission of the transgene. Methods for constructing 

homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Ctirr. Opin. Biotechnol. 2:823-829; PCT 
International Publication Nos.: WO 90/11354; WO 91/01140; WO 92/0968; and WO 
93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) 
Science 251:1351-1355. Ifa cre/loxP recombinase system is used to regulate 
5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 

15 of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula or blastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragment antisense molecules of the invention include (i) those that 

25 specifically recognize and hybridize to NgR RNA (as determined by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commercially available sequence comparison programs. After identification of the 

30 desired sequences, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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perfonned. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID NO:l, 3, 13 or sequences complementary or homologous thereto, derived 
5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this 
en2yme or pathological conditions relating thereto. In specific aspects, antisense 

10 nucleic acid molecules are provided that comprise a sequence complementary to at 

least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, 
derivatives and analogs of a NgR protein of SEQ ID NO:2, 4 or 14 or antisense 
nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs: 1, 3 or 

15 13 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR The term 
"coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues (e.g., the protein coding region of human 

20 NgR corresponds to the coding region SEQ ID NO: 1, 3 or 13). In another 

embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding NgR The term "noncoding 
region" refers to 5' and 3 1 sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5* and 3* untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID NO: 1, 3, 13 or mRNA corresponding thereto, 
including, but not limited to, the initiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ 
ID NO: 1, 3 or 13), antisense nucleic acids of the invention can be designed according 

30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 
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or noncoding region of NgR mKNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 

5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 

1 5 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, I- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

30 The antisense nucleic acid molecules of the invention (preferably 

oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mKNA 
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and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translational level is useful to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 

10 specifically contemplated for therapeutic use by the invention. The antisense 
oligonucleotides may be further modified by adding poly-L-lysine, transferrin 
polylysine or cholesterol moieties at their 5 ! end. 

An example of a route of administration of antisense nucleic acid molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nucleic 

15 acid molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 

20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol HI promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual (J-units, the strands run parallel to each other (Gaultier et al. 9 (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 
2 f -o-methylribonucleotide(Inouee/a/., (1987) Nucleic Acids Res. 15, 6131-6148) or a 

30 chimeric RNA-DNA analogue (Inouee/a/., (1987) FEBSLett. 215, 327-330). 

The NgR sequences taught in the present invention facilitate the design of 
novel transcription factors for modulating NgR expression in native cells and animals, 
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and cells transformed or transfected with NgR polynucleotides. For example, the 
Cys 2 -His2 zinc finger proteins, which bind DNA via their zinc finger domains, have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 
5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal et al, (1999) 

10 Proc. Natl Acad Sci. USA 96, 2758-2763; Liu et al, (1997) Proc. Natl Acad Set 

USA 94, 5525-5530; Greisman etal (1997) Science 275, 657-661; Choo etal, (1997) 
J. Mol Biol 273, 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a zinc finger protein consisting of 6 

15 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 
sequence (Segal et al, (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fused to activation or repression 
domains to promote or suppress NgR expression (Liu et al., (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 

20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et al, (1997) Proc, 
Natl Acad. Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode 

25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
express the transcription factor (gene therapy), or by introducing the protein. 
Engineered zinc finger proteins can also be designed to bind RNA sequences for use in 

30 therapeutics as alternatives to antisense or catalytic RNA methods (McColl et al, 
(1997) Proc. Natl Acad. Set USA 96, 9521-9526); Wu et al, (1995) Proc. Natl 
Acad Sci. USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription factors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

5 Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes, 

10 described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to 
catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR 
mRNA A ribozyme having specificity for a NgR-encoding nucleic acid can be 
designed based upon the nucleotide sequence of a NgR DNA disclosed herein (i.e., 
SEQ ID NOs:l, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA 

15 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA 
See, e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al U.S. Patent No. 
5, 1 16,742. Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al, 

20 (1993) Science 261, 1411-1418. 

Alternatively, NgR gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NgR (e.g., the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
of the NgR gene in target cells. See generally, Helene (1991) Anticancer DrugDes. 6: 

25 569-584; Helene. et al, (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher (1992) 
BioEssays 14, 807-815. 

In various embodiments, the nucleic acids of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

30 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup et al., (1996) Bioorg. Med. Chem. Lett. 4, 5-23). As used herein, the terms 
"peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 
5 solid phase peptide synthesis protocols as described in Hyrup et aL, (1996) above; 
Perry-O'Keefe al, (1996) Proc. Natl. Acad Sci. USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup etal, (1996), above; Perry-O'Keefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA Such 

20 chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), above and Finn et al. (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
eg* 5-(4-methoxytrityl) amino- 5' -deoxy-thymi dine phosphoramidite, can be used 
between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
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and a 3' PNA segment. See, Petersen et al (1975) Bioorg. Med. Chem. Lett. 5:1119- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see Letsinger et al, (1989) Proc. Natl Acad. Set 
USA 86, 6553-6556; Lemaitre etal, (1987) Proc. Natl Acad. Sci. USA 84, 648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al, (1988) Biotechniques 6, 

10 958-976) or intercalating agents (see, e.g., Zon (1988) Pharm. Res. 5, 539-549). To 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 

1 5 sequence of NgR. The NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 

20 molecule. The actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a frame shift 
in translation such that the predicted amino acid sequence will differ from that which 
would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 

25 starting at the point of the mutation. 

Polypeptides 

The invention also provides purified and isolated mammalian NgR polypeptides 
encoded by a polynucleotide of the invention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino acid sequence of NgR3, as set forth in SEQ ID NO:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 
5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID NO: 6, shown in Table 

15 5) excluding the previously characterized NgR ("NgRl 11 ), and polypeptides comprising 
at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case 

20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 

dividing the number of matched positions by the total number of positions in the region 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 

30 in the sequence being compared, when four gaps in a length of 100 amino acids may be 
introduced to maximize alignment (Dayhoff, in ATLAS OF PROTEIN SEQUENCE AND 
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Structure, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 

Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman (Adv. Appl Math.^ 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needleman and Wunsch (1970) J. Mol 

10 Biol. 48, 443-453) the following settings for nucleic acid sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

1 5 BestFit was originally written for Version 1 .0 by Paul Haeberli from a careful reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 

20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14. 

Alternatively, homology may be determined by hybridization analysis wherein a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 

25 conditions. See e.g. Ausubel, et al. , (Eds.) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 
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chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material 11 includes preparations of NgR protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 

5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 

10 produced, it is also preferably substantially free of culture medium, i.e., culture 

medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 

1 5 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 

20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 
5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgR protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 

25 or 14 that include fewer amino acids than the full length NgR proteins, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protein. A biologically active 
portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acids in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the NgR proteins (e.g., a 
conserved cysteine as the N-terminus of the mature protein, four conserved cysteines 
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in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 
5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 

1 0 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
ID NO: 2, 4 or 14. In other embodiments, the NgR protein is substantially homologous 
to SEQ ID NO:2, 4 or 14 and retains the functional activity of the protein of SEQ ID 
NO: 2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or 

15 mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID NO.2 or SEQ ID NO:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mammalian host cells is expected to provide for such post-translational 

modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino acid residues 
supplement a NgR amino acid sequence. Insertions may be located at either or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
acid sequence. Insertional variants with additional residues at either or both termini 

30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 
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Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR acid sequence or to a biologically active fragment thereof. 

Variant products of the invention also include mature NgR products, i.e., NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met^-NgR) are contemplated, as are variants with additional methionine 
and lysine residues at positions -2 and -1 (Met" 2 -Lys _1 -NgR). Variants of NgR with 
10 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly useful for enhanced recombinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 

15 residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A lf NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 

20 corresponding to a protein that is not homologous to the NgR protein, e.g., a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least 
one biologically active portion of a NgR protein. In another embodiment, a NgR 

25 fusion protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fusion protein comprises at least three biologically 
active portions of a NgR protein. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fused in-frame to each other. The non-NgR polypeptide can be fused to the N- 

30 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fusion protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fusion 
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proteins can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 
5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fusion protein is a NgR protein containing a 
heterologous signal sequence at its N-terminus. For example, the native NgR signal 
sequence (i.e., amino acids 1-30 of SEQ DO NO:2 and amino acids 1-40 of SEQ ID 

10 NO:4) can be removed and replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fused to 

15 sequences derived from a member of the immunoglobulin protein family. The NgR- 
immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be 

20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 

25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

30 techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 

restriction en2yme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 
5 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Ausubel et ah (Eds.) Current Protocols in Molecular Biology, 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A NgR- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
10 moiety is linked in-frame to the NgR protein. 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino terminus 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

15 In another aspect, the invention provides deletion variants wherein one or more 

amino acid residues in a NgR polypeptide are removed. Deletions can be effected at 
one or both termini of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g., ligand binding 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino 
acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred 

25 polypeptide fragments display antigenic properties unique to, or specific for, human 
NgR and its allelic and species homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 

30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 



Table 1. 





Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






x, 


G, R, M 




x 2 


A,D,C 




x 3 


V, T 




x 4 


N,P,S 




x 5 


E.A.S 




x,. 


nothing, K 


nothing 


x 7 


V,M,P 




X 8 


T, V 


V 


X, 


Q,P 


Q 




Q,A 


Q 


x„ 


Q,H,N 






G,N 


N 


x 13 


L,F 


F 


X 14 


Q, A,S 




x 15 


A,S 




x 16 


V,I 






V, T, E, L 






S,G 




X 19 


L,I 






A, E, V, P 






A,S,D 




x 22 


S,T 




X23 


Q, E 
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Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






x 24 


ivl 




.X* 


Q,H 


Q 


^26 


N,G 


N 


^27 


R,L 




^28 


T,G,R,S 




^29 


F,L,T,H 




^30 


L,V 


L 


X31 


Q,RP 






Q,P,A 


P 


^33 


G,A 


G 


X34 


H,T,S 




X35 


S.G.R 






P,S,A 




X37 


C, nothing 


nothing 


X 38 


R, nothing 


nothing 


X39 


A,N 






M,L 




X 41 


V,L,T 




' ^Q2 


T,I 


T 


X 43 


L,I 




X44 


Y,F,H 




X4S 


N,V 


N 




I,L 




X47 


T,S,A 






F,Y,T,R 




X 49 


A,H J Y J D 
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Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 








P,A 


P 


x 51 


N,S,G,A 




^52 


T,A 


T 




E,R,T 




X 54 


G,H 




x 5S 


F,L 




x 56 


V,Q,H 




x 57 


H.AJL 






E,Q 


E 




G,S 


G 




R,A 


R 


x« 


Q,H 


Q 


X 62 


R,H 


H 


x 63 


T,S 




x 64 


L,V 


L 


x 65 


AE,D 




x 66 


E,D,A 




X6 7 


Q,H 


Q 




V,E,G 






K,R 






H,Q 




x 71 


AS,T 






Y,H 




X73 


Y,D 


Y 


X 74 


K,R 




x 75 


G,Q 
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Xaa$ 


Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






X76 


S,Q 


S 




ASJE 






P,G 


P 


X 79 


AGP 






G,N 




X 8 l 


I,V,L 




Xg2 


G,R 




X 8 3 


H.V.A 




^84 


S,A 


s 


X 85 


D,E 




X 86 


H,S,A 




X 87 


I.L 




^88 


E,L,Q 




X 89 


Y,H,A 






Q,P 


Q 


X 9l 


D,N 






I,L,T 






V,AR 






VAG 






S,T 


s 




K,R 




x, 7 


L,I 


L 




W,R,S 




-^99 


S,L 




X 100 


L,V 


L 


X 10 l 


G,T,P 
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Xaa^ 


Column I 


Column II 


(based on a NTLKRCT 


(Rl, R2 5 R3) 


(R2+R3 only) 


domain) 






^102 


Q,P,E 




■^103 


G,H,R 




^104 


I,T,V,A 




^J05 


V,G,H 




^106 


N,S 




■y 


E,G,Q 




^108 


Q,R 












Q,A 




X U L 


W,G,H 




^112 


H,RP 




^113 


K,A,H 




^114 


H,R 




^115 


D,G 




^116 


H,R,S,G 




^117 


T,M 




^118 


T,I 




x n9 


F,Y 




^120 


N,A 




^121 


S,N 




•^122 


T,AS 




^123 


E,S,A 




^124 


Q,P 




^125 


G,T 




^126 


D,E 


D 


^127 


C,A 
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Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






X 128 


P,D 




X 129 


V,G,P,R 






A.S 




^131 


E,Q 


Q 


X132 


F,Y 


F 


X133 


GAD 




X134 


A,P 




X 13 5 


D,A,V 




X136 


G,D 




X i3 7 


A.E 




X138 


S,P 




X139 


E,A 




X 140 


L,F 




Xi4i 


RQ 




^142 


RK 


R 


X i4 3 


RK 


R 


X 144 


F,A 






G,V 




^146 


A,D,E 




X147 


T,P 




Xi4g 


A.V.S 




X149 


T,S,L 




X 15 o 


E,G,P,Q 




i X 1S i 


L,E,R 




Xis2 


RL 


R 


X 15 3 


G,D 
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Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






^154 


Q,H,A 




Xiss 


Q,R 




^156 


K,R 




^157 


L,A,R 




■^158 


RA 


R 


^159 


V,A3 




^160 


EAN 




i -^161 


F,L 


F 


[ ^162 


RQ 




^163 


NAG 





Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
1 5 contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 

20 

Table 2 
Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

ILV 
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Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR 

Aromatic HFWY 

Other N Q D E 

Alternatively, conservative amino acids can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 

(1975), pp. 71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions II 
SIDE CHAIN 



15 



CHARACTERISTIC 


AMINO ACID 


Non-polar (hydrophobic) 




A. Aliphatic: 


ALIVP 


B. Aromatic: 


FW 


C. Sulfur-containing: 


M 


D. Boderline: 


G 


Uncharged-polar 




A. Hydroxyl: 


STY 


B. Amides: 


NQ 


C. Sylfhydryl: 


C 


D. Boderline: 


G 


Positively Charged (Basic): 


KRH 


Negatively Charged (Acidic): 


DE 



20 



As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 

Table 4 

Conservative Substitutions m 

Original Exemplary Substitution 

Residue 

Ala (A) Val, Leu, lie 

30 Arg(R) Lys, Gin, Asn 

Asn (N) Gin, His, Lys, Arg 

Asp (D) Glu 

Cys (C) Ser 

Gln(Q) Asn 

35 Glu(E) Asp 
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r\aiij vjin, i^ys, /vi 3 


lie (i; 


i^eu, vai, jviet, Ala, i^ne, 


i^eu yLt) 


lie, vai 3 iviet, Aia, -rue 




\Jill, /\SU 




1 Oil PU yt rl a 

JLrcU, rile, ue 


x lie [V J 


jLrCu, v «u, ue, /\ia 


Pro(P) 


Gly 


Ser (S) 


Thr 


Thr(T) 


Ser 


Trp (W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val(V) 


He, Leu, Met, Phe, Ala 



In addition, amino acid residues that are conserved among family members of 
15 the NgR proteins of the present invention, as indicated by the alignment presented 
herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region in NgRs. Examples of these conserved domains include, e.g., 
leucine-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 
alteration. 

Full-length NgRs have an LRR region characterized by the amino acid 
consensus sequence shown in SEQ ID NO: 19. At least some full-length NgRs also 
include a CT signaling (CTS) domain and a GPI domain. 
25 The NgR domain designations used herein are defined as follows: 



Domain 


hNgRl 
SEQ ID: 5 


mNgRl 
SEQ ID 
NO: 17 


hNgR2 
SEQ ID: 2 


hNgR3 
SEQ ID: 14 


mNgR3 
SEQ ID: 4 


Signal Seq. 


1-26 


1-26 


1-30 




1-40 


LRRNT 


27-56 


27-56 


31-59 




41-69 


LRR1 


57-81 


57-81 


60-82 


5-27 


70-92 ' 


LRR2 


82-105 


82-105 


83-106 


28-51 


93-106 


LRR3 


106-130 


106-130 


107-131 


52-76 


106-141 


LRR4 


131-154 


131-154 


132-155 


77-100 


142-165 
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LRR5 


155-178 


155-178 


156-179 


101-124 


166-189 


LRR6 


179-202 


179-202 


180-203 


125-148 ! 


190-213 


LRR7 


203-226 


203-226 


204-227 


149-172 


214-237 


LRR8 


227-250 


227-250 


228-251 


173-196 


238-261 


LRRCT 


260-309 


260-309 


261-310 


206-255 


271-320 


CTS 
(CT 

Signaling) 


310^45 


310-445 


311-395 


256-396 


321-438 


GPI 


446-473 


456-473 


396-420 


370-392 


439-462 



10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain functionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications include conservative amino acid substitutions. Preferably such 

15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 

20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ID NO:2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS:l, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions can be made at one or more amino acid residues 

25 predicted to be non-essential. Alternatively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e.g., by saturation 
mutagenesis. The resulting mutants can be screened for Ng^R. biological activity. 
Biological activities of NgR may include but are not limited to: (1) protein:protein 
interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo- 

30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 
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It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
covalent in nature, and include for example, chemical bonding with polymers, lipids, 
5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
further embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 polyoxyethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly useful in assays of the 
invention; the variants are also useful in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

15 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected from the group consisting o£ for example, polyethylene 

20 glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyoxyethylated polyols (e.g. glycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 

25 an aldehyde for alkylation, so that the degree of polymerization may be controlled. 
Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. When the chemically modified NgR 
polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Alternatively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-CIO alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 
5 reactions known in the art, as described, for example, in the following references: 

Focus on Growth Factors 3, 4-10 (1992); EP 0 154 316; and EP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 

10 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 

1 5 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 

20 reaction products. It will be apparent to one of ordinary skill in the art to select the 

optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymer:NgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 

25 polypeptides described herein. However, the chemically-derivatized polymer:NgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as compared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof^ variants and derivatives, may be employed alone, together, or in 

30 combination with other pharmaceutical compositions. The cytokines, growth factors, 
antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutical^ acceptable (i.e., sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium. 
5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, 
dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 
Variants that display ligand binding properties of native NgR and are expressed 

10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 

15 present invention, one skilled in the art can identify and obtain nucleotide sequences 

which encode NgR from different sources (i.e., different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrook etal, MOLECULAR CLONING: A LABORATORY MANUAL, Second 
Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 

20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herein. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 

25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et ah (1989) above. 

A nucleic acid molecule comprising any of the NgR nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 

30 nucleotide sequences provided herein. See U.S. Patent Nos. 4,683, 195 to Mullis et at 
and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or 
double-stranded DNA The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 
5 mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger et al, Guide to Molecular Cloning Techniques, METHODS IN 
ENZYMOLOGY 152 Academic Press, San Diego, CA, which is incorporated herein by 
10 reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 

15 Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 

20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or fragments thereof Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab') 2 , and F v , are 

25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgR polypeptides exclusively (i.e., are able to distinguish 
NgR polypeptides from other known NgR polypeptides by virtue of measurable 
differences in binding affinity, despite the possible existence of localized sequence 

30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of 
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NgR such that an antibody raised against the peptide forms a specific immune complex 
with NgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least 15 amino acid residues, even more preferably at least 20 amino 
acid residues, and most^preferably at least 30 amino acid residues. Preferred epitopes 

5 encompassed by the antigenic peptide are regions of NgR that are located on the 
surface of the protein, e.g., hydrophilic regions. 

It will be understood that specific antibodies may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELIS A techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 

10 in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et al 
in Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments 

15 of the NgR polypeptides of the invention are also contemplated, provided that the 
antibodies are specific for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals {e.g., 
rabbit, goat, mouse or other mammal) may be immunized by injection with the native 

20 protein, or a synthetic variant thereof or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed NgR 
protein or a chemically synthesized NgR polypeptide. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund ! s (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin and Corynebacterium pannim or similar immunostimulatory 
agents. If desired, the antibody molecules directed against NgR can be isolated from 
the mammal (e.g., from the blood) and further purified by well known techniques, such 

30 as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one species 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding affinity for 
a particular NgR protein with which it immunoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 
5 analogs or homologs thereof, any technique that provides for the production of 

antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not limited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et al, (1983) Immunol Today 4, 72) and the EBV hybridoma 

10 technique to produce human monoclonal antibodies (see Cole et al, (1985) in 

MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). 
Human monoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et al, (1983) Proc. Natl 
Acad. Set USA 80, 2026-2030) or by transforming human B-cells with Epstein Barr 

15 Virus in vitro (see Cole et al, (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to a NgR protein (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries 
(see e.g., Huse et al, (1989) Science 246, 1275-1281) to allow rapid and effective 

20 identification of monoclonal F^ fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 

25 into the antibody framework to modulate affinity or immunogenicity. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art including, but not limited to: (i) an F(ab') 2 fragment produced by 
pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab , ) 2 fragment; (iii) an Fab fragment generated by the 

30 treatment of the antibody molecule with papain and a reducing agent and (iv) F v 
fragments. 
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Additionally, recombinant anti-NgR antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 
5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCT International Application No. PCT/US86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT International Publication No. WO 86/01533; 
U.S.Pat. No. 4,816,567; European Patent Application No. 125,023; Better et al 9 

10 (1988) Science 240, 1041-1043; Liu etal, (1987) Proa Natl Acad Sci. USA 84, 
3439-3443; Liu et al, (1987) J. Immunol. 139, 3521-3526; Sun et al, (1987) Proa 
Natl Acad. Set USA 84, 214-218; Nishimura et al, (1987) Cancer Res. 47, 999-1005; 
Woodetal, (1985) Nature 314, 446-449; Shaw etal,. (1988)./ Natl. Cancer Inst. 
80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oi etal, (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al, (1986) Nature 321, 
552-525; Verhoeyan et al, (1988) Science 239, 1534; and Beidler et al, (1988) J. 
Immunol 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 

20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 

25 IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such 
antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 

30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a 
NgR protein possessing such a domain. Antibodies that are specific for one or more 

BNSDOCiD: <WO 0229059A2_I_> 



WO 02/29059 



PCT/US01/31488 



-66- 

domains within a NgR protein, e.g., domains spanning the above-identified conserved 
regions of NgRs, or derivatives, fragments analogs or homologs thereof, are also 
provided herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein (e.g. , for use in measuring levels 
of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as pharmacologically- 

10 active compounds [hereinafter 'Therapeutics"]. 

An anti-NgR antibody (e.g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can facilitate the purification of natural NgR from cells and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 

15 can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin and aequorin, and examples of suitable radioactive material include 

30 "V'V'Sor 3 !!. 

Another aspect of the present invention is directed to methods of inducing an 
immune response in a mammal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 
5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 

10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized animal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-producing cells, such as splenic cells, of the immunized animal may be used 

15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 

20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the binding of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 

25 an antibody response with the same specificity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image 
anti-idiotype" and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) ANTI-IDIOTYPIC ANTIBODIES AND INTERNAL IMAGE, 
in Monoclonal and anti-idiotypic antibodies: Probes for receptor structure 

30 AND FUNCTION, Venter J.C. et al (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etal, (1986)232, 220-223; 1047; McNamarae/tf/., (1985) Science 226, 1325-1326). 
Internal image anti-idiotypic antibodies have also been shown to induce immunity to 
tumor related antigens (Raychauhuri et aL, (1986) J. Immunol 137, 1743-1749; 
Raychauhuri et aL, (1987) J. Immunol 139, 3902-3910; Bhattacharya-Chatterjee et 
5 aL, (1987) J. Immunol 139, 1354-1360; Bhattacharya-Chatterjee et aL, (1988) J. 
Immunol 141, 1398-1403; Herlyn. etal (19S9) Intern. Rev. Immunol 4, 347-357; 
Chen et aL (1990) Celllmm. Immunother. Cancer 351-359; Herlyn et aL, (1991) in 
vivo 5, 615-624; Furuya e/a/. (1992) Anticancer Res. 12, 27-32; Mittelman, A. etal. 
(1992) Proc. Mtf/. ^cad CAS4 89, 466-470; Durrani etal, (1994) Cancer Res. 
10 54, 4837-4840; Mittelman. e/a/. (1994) Cancer Res. 54, 415-421; Schmitt. etal 

(1994) Hybridoma 13, 389-396; Chakrobarty. efa/. (1995) J. Immunother. 18, 
95-103; Chakrobarty. a/. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. etal 

(1995) Clin. Cancer Res. 1, 1205-1294; Herlyn a/. (1995) Hybridoma 14, 159-166; 
Sclebuschefa/. (1995) Hybridoma 14, 167-174; Herlyn. efa/. (1996) Cancer 

15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 

20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the immunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 |ig 

25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the immunization period using standard 

30 immunoassay methods. Animals with suitable titers of antibodies reactive with the 

target monoclonal antibodies may be reimmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, spleen cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fused with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 
5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprising an antibody of the invention for any of 
the purposes described herein are also comprehended. 

Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

15 The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpful for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amount of NgR in the sample with a 
standard. The compound or agent can be packaged in a suitable container. 

25 Screening Assays 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds include solution assays, in vitro assays wherein NgR polypeptides are 

30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with NgR normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules (e.g., molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stimulatory- or inhibitory effect on, 
5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 

10 library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 

15 non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al, (1993) Proc. Natl. Acad. Sci. USA 90, 6909; Erb 
et al, (1994) Proc. Natl. Acad. Sci. USA 91,1 1422; Zuckermann et al. (1994) J. Med. 

20 Chem 37, 2678; Cho et al., (1993) Science 261, 1303; Carrell et al., (1994) Angew 
Chem. Int. Ed Engl. 33, 2059; Carell et al, (1994) Angew Chem. Int. Ed. Engl. 33, 
2061; and Gallop et al, (1994) J. Med Chem 37, 1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 

25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores (Ladner, above), plasmids (Cull etal (1992) Proc. Natl Acad. Sci. USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal (1990) Proc. Natl Acad Sci. USA 87, 6378-6382; 
Felici (1991) J. Mol. Biol. 222, 301-310; Ladner, above). 

30 



BNSDOC1D: <WO 0229059A2_I_> 



WO 02/29059 



PCT/US01/31488 



-71 - 

1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a NgR polypeptide expressed on the surface 

5 of a cell with a candidate binding partner compound and detecting binding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 

10 (e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion 
thereof. 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

15 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 

20 labeled compound in a complex. For example, test compounds can be labeled with 
125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or luciferase, and the enzymatic label detected by determination 

25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protein 
or a biologically active portion thereof, on the cell surface with a known compound 
which binds NgR to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a NgR 

30 protein, wherein determining the ability of the test compound to interact with a NgR 

protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 
5 binds or interacts in nature, for example, a molecule on the surface of a cell which 

expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a NgR 

10 target molecule is a component of a signal transduction pathway that facilitates 
transduction of an extracellular signal (e.g., a signal generated by binding of a 
compound to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 

15 molecules with NgR. In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compounds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgR products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestly useful in modulating (i.e., blocking, inhibiting or stimulating) biological 
activities of NgR, especially those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgR binding 
partners. In solution assays, methods of the invention comprise the steps of (a) 
30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compounds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolating the NgR 
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polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the physical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the invention. In one aspect, the NgR polypeptide/binding 
5 partner complex is isolated using an antibody immuno specific for either the NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate binding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 
10 polypeptide/binding partner complex through interaction with the label or tag. An 
exemplary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 

20 detecting binding of the candidate compound to the Ng|R polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
. binding of NgR is detected. Immobilization is accomplished using any of the methods 
well known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 

25 antibody binding, or use of streptavidin/biotin binding wherein the immobilized 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 

30 tubes. In one embodiment, a fusion protein can be provided that adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be 
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adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 
5 (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either directly or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 

10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 

15 techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 

20 by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

25 Detection of binding can be accomplished (i) using a radioactive label on the 

compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, (iii) using an antibody immunospecific for the 
non-immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, (v) 

30 determining the activity of the NgR, as well as other techniques well known and 
routinely practiced in the art. 
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Determining the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target (i.e. 
intracellular Ca 2+ , diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherein determining 
the ability of the test compound to interact with a NgR protein comprises deteimining 

15 the ability of the test compound to preferentially bind to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compound which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 

20 determining the ability of the test compound to interact with a NgR protein, wherein 
determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the NgR protein to modulate the activity of a NgR target 
molecule. 

The cell-free assays of the present invention are amenable to use of both the 
25 soluble form or the membrane-bound form of NgR. In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
30 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 

Isotridecypoly(ethylene glycol ether) n , 3-(3-cholamidopropyl)dimethylamminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate. 

Modulators 

5 Agents that modulate (i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the effect of the putative 
modulator on NgR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its effects on NgR to its 

0 effect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will 
be therapeutically useful in treatment of diseases and physiological conditions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 

5 modulators may be used in the treatment of such diseases and conditions associated 
with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 

0 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 

5 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner compound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
0 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein in the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of NgR expression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 
10 determined by methods described herein for detecting NgR mRNA or protein. 

High Throughput Screening 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity {i.e., affect 

15 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an efficient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds" having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 

20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic acid molecule encoding the same, with a 

25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by binding assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays, 
Western blots, radiolabeled competition assay, phage-based expression cloning, 
(^-fractionation by chromatography, co-precipitation, cross linking, interaction 

30 trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are 

described in, for example, Ausubel et ah (Eds.), CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos etal, 
(1993) Cell!!, 223-232; Madura etal., (1993) J. Biol. Chem. 268, 12046-12054; 
Bartel etal, (1993) BioTechniques 14, 920-924; Iwabucki etaL, (1993) Oncogene 8, 
5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 
with NgR ("NgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
NgR-binding proteins are also likely to be involved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 
Other assays may be used to identify specific ligands of a Ngjt receptor, 

10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 

1 5 Fields et al, (1989) Nature 340, 245-246, and Fields et al 9 (1994) Trends Genet 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 

20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 

25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein 
and (2) an activation domain fused to a second protein. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 

30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fragment thereof, that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 
5 with the binding interaction. Expression of the reporter gene is monitored as different 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened include (which may include 
compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological or chemical origin. 

10 The function of the NgR gene product is unclear and no ligands have yet been 

found which bind the gene product. The yeast two-hybrid assay is useful to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fragment thereof, a fusion polynucleotide encoding both a NgR 
receptor (or fragment) and a UAS binding domain (i.e., a first protein) may be used. 

15 In addition, a large number of hybrid genes each encoding a different second protein 
fused to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding region being fused to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 

20 necessary to know the identity or function of the second binding protein. The system 
. is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bind to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 

30 target protein), the target protein molecule bound by the ligand remains in its folded 

state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protein. The function of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 
5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldt et al (1997) Anal Chem. 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 

10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
from the target protein and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 

15 with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a 
fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic 

20 label. Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a 
test may either be free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly or associated with a portion of a cell. One skilled in the art 

25 can, for example, measure the formation of complexes between NgR and the 
compound being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 

30 compounds which modulate (z.e., increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compound modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sample 
containing the test compound is higher than the activity in the sample lacking the test 
compound, the compound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
include (which may include compounds which are suspected to modulate NgR 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 

10 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 

15 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phospholipases and ion channels. 
Thus, modulators of NgR activity may alter a NgR receptor function, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 

25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [ 35 S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLIPR assay for 
intracellular Ca concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay (e.g., using [ 3 H]-arachidonic acid) and an assay for 

30 extracellular acidification rates, as well as other binding or function-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the binding of a natural or an unnatural iigand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 
5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
function as activators or inhibitors (competitive, uncompetitive and non-competitive) 
(e.g. , antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 

1 5 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example, Enzyme Assays: A Practical Approach, Eisenthal and 

20 Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of unknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 

25 of radiolabeled ligands in HTS binding assays for drug discovery (see Williams (1991) 
Med Res. Rev., 11, 147-184; Sweetnam etal, (1993) J. Nat. Prod. 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 

30 Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herein by 
reference in its entirety). 
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A variety of heterologous systems is available for functional expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg et al. (1992) Trends Pharmacol. Scl 13, 95-98), yeast 
(Pausch (1997) Trends Biotechnol 15, 487-494), several kinds of insect cells (Vanden 

5 Broeck (1996) Int. Rev. Cytol 164, 189-268), amphibian cells (Jayawickreme et al 
(1997) Curr. Opin. Biotechnol. 8, 629-634) and several mammalian cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt et al. (1997) Eur. J. Pharmacol 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). 

0 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compounds with 
NgR and assaying for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in bound form, and the amount of free or uncomplexed 

5 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compounds are contacted with NgR and washed. Bound NgR is then 

0 detected by methods well known in the art. Purified polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed NgR can be used for HTS binding assays in 

5 conjunction with its defined ligand. The identified peptide is labeled with a suitable 

125 3 35 32 

radioisotope, including, but not limited to, I, H, S or P, by methods that are 
well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) DrugDiscov. Today 2, 156-160). Radioactive 
0 ligand specifically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (Williams (1991) Med Res. Rev. 1 1, 147-184; 
Sweetnam et al (1993) J. Nat. Prod 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1, 85-91 Bosse et al. 
5 (1998) J. Biomol. Screening 3, 285-292). Binding of fluorescent ligands can be 
detected in various ways, including fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers (1997) DrugDiscov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

10 Examples of such biological responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in BiotechnoL 15, 487-494); changes in 
intracellular Ca 2+ concentration as measured by fluorescent dyes (Murphy etal. (1998) 
Cur. Opin. Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to 

15 monitor ligand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
etal. (1996) J. Biomol. Screening 1, 75-80). Melanophores prepared from Xenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 

20 (Jayawickreme etal. (1997) Curr. Opin. BiotechnoL 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 

25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor. In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain significant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used in receptor binding assays, employing the 

30 radiolabelled ligand specific for the particular receptor. Alternatively, a functional 
assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 
concentration or membrane potential in permanently transfected CHO cells containing 

BNSDOCID: <WO 0229059A2_I_> 



WO 02/29059 PCI7US0 1/3 1488 

-85- 

each of these receptors individually or in combination would be preferred for HTS. 
Equally preferred would be an alternative type of mammalian cell, such as HEK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art {e.g., Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is 
immobilized and interaction with a binding partner is assessed in the presence and 

10 absence of a candidate modulator such as an inhibitor compound. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 

15 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 

20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs of 
known compounds or analogs of compounds that have been identified as "hits" or 

25 "leads" in other drug discovery screens, some of which are derived from natural 
products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 

30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof. For a review, 
see Cane etal, Science (1998) 282, 63-68. Combinatorial iibraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 
5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 

polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin. Biotechnol. 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as well as such peptide modulators as neuropeptides other 

1 5 than natural ligands, antibodies, antibody fragments, and modified compounds 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the 

20 invention specifically compete with a test compound for binding to the polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin et ah y (1997) Antimicroh Agents Chemother. 41, 
2127-213 1, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 

30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 
5 In one embodiment of affinity purification, a polypeptide is covalently coupled to a 

chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered from the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 
10 This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 

1 5 time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 

20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving from the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are useful to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 

25 sequences present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 

30 designed. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 
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Corapositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein, "pharmaceutically acceptable carrier 1 ' is intended to include 

10 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are useful in this context are, 

1 5 inter alia , found in Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline, Ringer ! s 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 

20 agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 

25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g., intravenous, intradermal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 

30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

10 water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 
In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

15 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

20 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 

25 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

30 incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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preparation are vacuum drying and freeze-drying that yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 

10 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant 

15 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 

20 propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 

25 salts, and fiisidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
30 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 

10 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,811. It is 
especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 

15 treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved. 

20 The nucleic acid molecules of the invention can be inserted into vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e.g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen et al (1994) Proc. Natl Acad 

25 Set USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 

30 produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be administered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 including but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herein, or any portion thereof, can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 

15 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 

20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizing (stimulating) or 

antagonizing a NgR natural binding partner associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to effect said agonism or antagonism. One 
embodiment of the present invention, then, is a method of treating diseases in a 

30 mammal with an agonist or antagonist of the protein of the present invention 
comprising administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize NgR-associated functions. 
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Methods of determining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 
5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 

10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

15 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 

20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, compounds 
displaying good pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell 

25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 

30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
Panel on Euthanasia, (1993) J. Am. Vet. Med Assoc. 202:229-249). Representative 
5 animals from each treatment group can then be examined by gross necropsy for 

immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 
10 less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 
15 Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs: 
1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/Ugands which activate, agonize, or antagonize NgR and 
20 for compounds with potential utility in treating disorders including CNS disorders 

{e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, 
e.g., Donovan et al., (1997) J. Neunosci. 17, 5316-5326; Turgeon et aL, (1998) J. 
25 Neurosci. 18, 6882-6891; Smith-Swintosky et aL 9 (1997) J. Neurochem. 69, 

1890-1896; Gill etaL, (1998) Brain Res. 797, 321-327; Suidanet al., (1996) Semin. 
Thromb. Hemost. 22, 125-133. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NgR 
activity (e.g., NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to* treat (prophylactically or therapeutically) 
5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 

10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of NgR protein, expression of NgR nucleic acid or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

20 persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol. 23, 983-985 and 
Linder (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drugs (altered drug 

25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency 
is a common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofiirans) and consumption of fava beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
• of a drug. These polymorphisms are expressed in two phenotypes in the population, 
5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 

10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 

15 amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 

20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression 
or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical 
30 trials. For example, the effectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protein levels or upregulate NgR 
activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downregulated NgR activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NgR gene 
expression, protein levels, or downregulate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgRj*ene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immune responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent (e.g., compound, drug or small molecule) that modulates NgR activity 
(e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression (i.e., a 
15 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 

described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 

25 identified by the screening assays described herein) comprising the steps of (/) 

obtaining a pre-administration sample from a subject prior to administration of the 
agent; (//) detecting the level of expression of a NgR protein, mRNA, or genomic 
DNA in the preadministration sample; (///) obtaining one or more post-administration 
samples from the subject; (zv) detecting the level of expression or activity of the NgR 

30 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the NgR protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (vz) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of NgR to higher levels than detected, 
i.e: to increase the effectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 
than detected, i.e., to decrease the effectiveness of the agent. 



Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

1 5 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (if) antibodies to a NgR 
peptide; (Hi) nucleic acids encoding a NgR peptide; (iv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 

20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 

25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 

30 Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
bioavailability. 



BNSDOCID: <WO 0229059A2_!_> 



WO 02/29059 



PCT/US01/31488 



-99- 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
5 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 

10 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

15 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

20 Another aspect of the invention pertains to methods of modulating NgR 

expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 

25 protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 

30 Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 



10 Gene Therapy 

Mutations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore NgR activity to treat those disease states. Delivery of a functional 
NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 

15 and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature, supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Set Am. 68-84; and Miller (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other human disease 

states, preventing the expression o£ or inhibiting the activity of, NgR will be useful in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the expression of NgR. 

The present invention provides for both prophylactic and therapeutic methods 

25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (/) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (//) antibodies to a NgR 
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peptide; (Hi) nucleic acids encoding a NgR peptide; (iv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
5 recombination (see, e.g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 

10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 

15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 

20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 

25 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
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NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The? modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 

10 or more NgR protein activity. Examples of such stimulatory agents include active 

NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 

15 culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

The present invention is not to be limited in scope by the specific embodiments 

25 described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figure. Such modifications are intended to fall within 
the scope of the appended claims. 

The following Table 5 contains the sequences of exemplary polynucleotides 

30 and polypeptides of the invention. 
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TABLES 

The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 



10 



15 



20 



25 



30 



35 



40 



ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 
CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 
CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 
AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTT 
GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 
ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 
TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 
TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 
TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAAC 
CTGAGCCACCTCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 
GGCCTGGGCAGCCTGOACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGGCGTGCACCGC 
GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTGGCC 
TCGCTGCCCGGCGAGGCGCTCGCCGACCTGCCCTCGCTCGAGTTCCTGCGGCTCAACGCT 
AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 
GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 
GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 
CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 
CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 
GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 
GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 
CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 

The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid 
sequence derived from the DNA sequence of SEQ ID NO. 1 : 



MLPGLRRLLQAPASACLLLMLLALPLAAPS 
PMLCTCYSSPPTVSCQANNFSSVPLSLPPS 
QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 
LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 
DTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
SLQYLYLQENSLLHLQDDLFADLANLSHLF 
HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
VHRAAFRGLSRLTILYLFNNSLASLPGEAL 
DLPSLEFLRLNANPWACDCRARPLWAWFQR 
RVSSSDVTCATPPERQGRDLRALREADFQA 
PPAAPTRPGSRARGNSSSNHLYGVAEAGAPP 
ADPSTLYRDLPAED SRGRQGGDAPTEDDYWG 
GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 
G LP SPL LCLLLL VPHHL 



C 
T 



V 
L 
G 
A 
A 
C 
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The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGCTCAGGTCTCA 
GGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAGAGCTACCTCTGGGTGGTGGT 
TGTCCTCGAGACTGTGTGTGCTACCCTGCGCCCATGACTGTCAGCTGCCAGGCACACAAC 
TTTGCTGCCATCCCGGAGGGCATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAAT 
CGCATCACCTTCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTAC 
TCCAACAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTGCATCTGGAGGAG 
CTAGACCTTGGAGACAACCGACAGCTGCGAACGCTGGCACCCGAGACCTTCCAAGGCCTG 
GTGAAGCTTCACGCCCTCTACCTCTATAAGTGTGGACTGAGCGCCCTGCCCGCAGGCATC 
TTTGGTGGCCTGCACAGCCTGCAGTATCTCTACTTGCAGGACAACCATATCGAGTACCTC 
CAAGATGACATCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG 
CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCGGTTGCTGCTG 
CATGAGAACCAGCTACAGTGGGTTCACCACAAGGCTTTCCATGACCTCCACAGGCTAACC 
ACCCTCTTTCTCTTCAACAACAGCCTCACTGAGCTGCAGGGTGACTGTCTGGCCCCCCTG 
GTGGCCTTGGAGTTCCTTCGCCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGT 
TCCCTGTGGGAATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC 
CCCGAGCTGCGGCAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTTCCGGAACTGC 
ACAGGACCAGTGTCTCCTCACCAGATCAAGTCTCACACGCTTACCACCTCTGACAGGGCT 
GCCCGCAAGGAGCACCATCCGTCCCATGGGGCCTCCAGGGACAAAGGCCACCCACATGGC 
CATCCGCCTGGCTCCAGGTCAGGTTACAAGAAGGCAGGCAAGAACTGCACCAGCCACAGG 
AACCGGAACCAGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT 
GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCACCGCACGACCCAAGAGG 
AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTGCAGCAGGCATCC 
TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 
CGC 



30 



35 



40 



The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID 

NO:3: 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
AGELPLGGGCPRDCVCYPAPMTVSCQAHNFA 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
M VTL WI Y SNNITFI APNTFEGFVHLEELDL G 
DNRQLRTLAPETFQGLVKLHALYLYKCGLSA 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIFV 
DLVNLSHLFLHGNKL WSLGQGIFRGL VNLDR 
LLLHENQLQ W VHHK AFHDLHRLTTLFLFNNS 
LTELQGDCLAPLVALEFLRLNGNAWDCGCRA 
RSLWEWLRRFRGSSS AVPCATPELRQGQDLK 
LLRVEDFRNCTGPVSPHQIKSHTLTTSDRAA 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQDYAPD 
YQHKFSFDIMPT ARPKRKGKCARRTPIRAPS 
GVQQASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO. 5> is NgRl from humans: 

MKRAS AGGSRLLAWVLWLQAWQVAAPCPGA 
C 

CYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVL 
ARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLS ALPTEALAP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFR 
GSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPD AA 
DKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 
EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQ A 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALV 
LWTVLGPC 

The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXLWX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 



The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGVIQAIQKSDEGHPFRAYLESEVAISE 
ELVQKY SNSALGHVNCTIKELRRLFLVDDLV 
DS L K 
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The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSC Q A NNFSSVPLSLPPS 
TQRLFLQNNLIRTLRPGTFGSNLLTLWLFSN 
NLSTIYPGTFRHLQALEELDLGDNRHlRSLE 
PDTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANLSHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNNSLASLPGEAL 
ADLPSLEFLRLNANPWACD CRARPLWAWFQR 
ARVSSSDVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPDSRGPALS 
AGLPSPLLCLLLL VP H H L 



The following protein sequence <SEQ ID NO :9> is the amino acid sequence of the 
mature NgR3 : 

CPRDCVCYPAPMTVSCQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHPSHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
ALGA PL L AW I L G L A VT LR 



The following amino acid sequence <SEQ ID NO:10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 
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The following amino acid sequence <SEQ ID NO: 1 1> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXSSSXVXCXXP 
XXX XGXDLXXLXXXDXXXC 

5 

The following amino acid sequence <SEQ ID NO:12> is a conserved Leu cine-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
10 XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXLA 
15 X L XX L XX LR L 



Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 

20 are prophetic. 

As shown by the following Examples, a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 

25 (SEQ ID NO: 10) N-terminal to the leucine-rich region, a second cysteine-rich domain 
(SEQ ID NO: 11) C-terminal to the leucine-rich region, and a putative 
glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the 
previously identified NgR sequence. The NgR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and 

30 NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 

Example 1: Tblastn query of the HTG database 

The protein sequence for the human NgR (NgRl) (SEQ ID NO:5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 
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NIH genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences (Nucleic Acids Res. (2000) 28, 15-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus when one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the number of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al 9 (1990) J. Mol Biol 215, 

10 403-410, which is incorporated herein by reference in its entirety). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 

15 positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et aL, supra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 

alignment score falls off by the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 

25 alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (see Henikoff et aL, (1992) Proa Natl Acad Set USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) alignments 
(B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin et aL , (1993) Proa Natl Acad Set USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 
statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, 
5 preferably less than about 0. 1 3 more preferably less than about 0.01, and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 

10 in all reading frames (J. Mol Biol. (1990) 215, 403-410: Nucleic Acids Res. (1997) 
25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR In addition to finding hits to 
genomic clones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 

15 clones were found (Accession numbers: AC068514, AC016869, AC013606) with an 
e-value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an 
e-value of le-78. The three human clones all appeared to encode the same gene, so 
further analysis was confined to AC013606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 

The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ (J. Mol Biol (1997) 268, 78-94) which can identify 

25 complete exon/intron structures of genes in genomic DNA. The gene prediction by 
GENESCAN™ contained seven exons. By comparing these predicted exons to the 
NgR, it was concluded that the new human gene contains two of these exons and a 
part of another (containing the initiating methionine). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC013606 (HTG1 1; deposited 

30 March 2000; length = 143899; GenBank release 1 18.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID NO:l (nucleotide sequence of human NgR2; AC013606). The 
translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID 
NO:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the NgR2 mRNA is expressed in a number of tissues including fetal brain. 
Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GBJEST19:AI346757) contains 565 nucleotides 
corresponding to amino acids 84-271 of the human NgR2 (SEQ ED No:4). This spans 

10 the second intron located between amino acids 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GB JEST26:AI929019) contains 545 nucleotides, part of which corresponds to amino 
acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located 
between amino acids 10 and 1 1, and provides positive evidence for the splicing of 

15 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The human NgR protein sequence aligned with only one region of translated 
sequence from nucleotide sequence AC021768, indicating that most of the new mouse 

20 gene was encoded by one large exon. However, upon inspection, the protein encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one found by visual 
inspection and a short one at the 5 f end which provided an initiating methionine. The 

25 predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 
(HTG14; deposited March 2000; length = 215980; GenBank release 118.0; SEQ ID 
NO: 16) by combining nucleotides from the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide 

30 sequence of mouse NgR3; AC021768). The translation of this cDNA provides the 
protein sequence of mouse NgR3 (SEQ ID NO:4). 
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We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 
nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 NO:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 
1 A- IB. The similarities between these proteins include: 
10 (1) The SignalP program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 
Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) All proteins contain eight Leucine Rich Repeats (LRR). LRRs are short 
sequence motifs present in a number of proteins with diverse functions and cellular 

15 locations. These repeats are usually involved in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domain 
(LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 (4) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished from those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain. 

(5) All three proteins contain a conserved cysteine in the fourth LRR 
25 domain. 

(6) All three proteins contain a conserved potential glycosylation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgJU, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to 
remain tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

A Nogo binding assay was developed which utilizes a method widely used in 
examining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen(1998),4rcra/. Rev. Neurosci. 21,3 09-345; Takahashi etaL (1999) 
5 Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
expression vector is created encoding a signal peptide, a His6 tag for purification, AP, 
and the 66 amino acid active domain of Nogo, The fusion protein can be purified from 

10 the conditioned medium of transfected cells in milligram amounts. This protein is 
biologically active as a growth cone collapsing agent with an EC 50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 

15 GST-Nogo may be purified from the culture medium and used as a GST fusion 

protein, or GST may be cleaved from the Nogo portion of the fusion protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fusion protein. Such sites are part of the 
commercially available GST vectors. The specific cleavage sites and enzymes may be 

20 used in accordance with the Manufacturer's specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaiyotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELIS A-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a competitive binding assay 
using increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 



30 



Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 
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encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 
5 indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino 
acids derived from Nogo. Furthemore, specific affinity of Nogo for the NgR2 or 
NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 

1 5 well known techniques for Northern blotting. The blot is probed with a fragment of 
NgR2 (SEQ ID NO: 1) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal 
Biochem. 132, 6-13). Hybridization is carried out at 68°C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 

20 SSC/0.05% SDS at room temperature and two washes with 0.1X SSC/0.1% SDS at 
50°C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 



Example 8: Cloning of cDNA corresponding to NgRs 

25 

To obtain the full-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2. Such a tissue was identified in 
Example 2. 1 x 10 6 plaque forming units from the cDNA library may be screened in 
30 duplicate on OPTITRAN™ filters. The filters are hybridized with 32 P-labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (50mM Tris pH 
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7.5, lMNaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
Ficoll) containing 10% Dextran sulfate and 100|xg/ml tRNA and 80 pmol each 
32 P-labeled oligonucleotide at 65°C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to film. Duplicate positives 
5 are purified. DNA from each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the 32 P-labeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
10 full-length clone corresponding to NgR3. 

Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art employing conventional PGR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

15 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the brain, for example, 
coronal and sagittal rat brain cryosections (20 \im thick) are prepared using a Reichert- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free 
slides (CEL Associates, Inc., Houston, TX), and stored at -80°C. Sections are 

20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine 
buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 

25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fashion. 

A NgR-specific probe may be generated using PCR Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 

30 the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-off transcripts {i.e., cRNA riboprobes) using the vector-borne T7 promoter and 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 
5 and cognate polymerase. The riboprobes may be labeled with [ 35 S]-UTP to yield a 
specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and about 0.65 
x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, IX 

10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain cryosections may be independently exposed to 45 \il of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52°C to allow hybridization to occur. Equivalent series of cryosections are then 

1 5 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 jig/ml 
RNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37 8 C. The cryosections are then subjected to three 

20 high-stringency washes in 0. 1 X SSC at 52°C for 20 minutes each. Following the 

series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and exposure to 
Kodak BioMax™ MR-1 film. After 13 days of exposure, the film is developed, and 
any significant hybridization signal is detected. Based on these results, slides containing 

25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emulsion and the slides are stored in the dark for 32 days. The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grains (generated by antisense probe 

30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains found between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect a NgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 
decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 
5 Expression of NgR in the brain provides an indication that modulators of NgR 

activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropath}', neuroses, and the like. Use of NgR 
modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having 
10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
15 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe fragment for 32 P-labeling. The PCR is performed as follows: 



Mix: lpl NgR-containing plasmid 
2[d fwd primer (1 0-50 pM) 
20 2jil rev primer (10-50 pM) 

10^x1 lOxPCR buffer (such as that provided with the enzyme, 

Amersham Pharmacia Biotech) 
l|il lOmM dNTP (such as #1 969 064 from Boehringer Mannheim) 
0.5pl Taq polymerase (such as #27-0799-62, Amersham Pharmacia 
25 Biotech) 

83.5|xl water 
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PCR is performed in a Thermocycler using the following program: 



94'C 



5min 



94°C 
55°C 
1TC 



lmin 
lmin 
lmin 



30 cycles 



72°C 



lOmin | 



10 



15 



20 



25 



The PCR product may be purified using QIAquick PCR Purification Kit 
(#28104) from Qiagen, and radictively labeled with 32 P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. 
Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD 146-8 14) bands of an appropriate size are 
visualized. 

Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 
A. Expression of NgR in Mammalian Cells 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned 
into the commercial expression vector pzeoS V2 (Invitrogen, San Diego, CA) and 
transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 
product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 ng/ml zeocin (Stratagene, LaJolla, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 
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allows for stable transformation of CHO cells. Optionally, NgR may be purified from 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. 
5 The NgR also may be expressed in-frame with a tag sequence (e.g. , polyhistidine, 
hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR from the host cell. 

1 0 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasmid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 
selectable marker such as DHFR. 

C. Expression of NgR in 293 cells 

For expression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The forward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5 1 extension of 
nucleotides to introduce the Hindin cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5' extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the NgR sequence. The 
PCR conditions are 55°C as the annealing temperature. The PCR product is gel 
purified and cloned into the HindfiR-Xhol sites of the vector. 
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The DNA is purified using Qiagen chromatography columns and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN). Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
5 antibodies. Permanently transfected cells are selected with Zeocin and propagated. 
Production of the recombinant protein is detected from both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D. Transient Expression of Nogo-R in COS cells 

10 For expression of the NgR in COS7 cells, a polynucleotide molecule having a 

nucleotide sequence of SEQ ID NO: 1, for example, can be cloned into vector p3-CI. 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

15 The forward primer is determined by routine procedures and preferably 

contains a 5 ! extension which introduces an Xbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5- 
extension of nucleotides which introduces a Sail cloning site followed by nucleotides 

20 which correspond to the reverse complement of a nucleotide sequence of SEQ ID 
NO:l. 

The PCR consists of an initial denaturation step of 5 min at 95 °C, 30 cycles of 
30 sec denaturation at 95°C, 30 sec annealing at 58°C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated into 

25 the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli cells 
for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 

30 expression. 

NgR expressed from a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80°C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID NO:3 or SEQ ID 
NO:13. 

5 

E. Expression of NgR in Insect Cells 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ED NO:l, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a 5 f 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5' extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 

15 SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAcIMl. Other suitable vectors for the 
expression of NgR polypeptides can be used, provided that the vector construct 

25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
inLuckow etal, Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al (1987) A MANUAL OF METHODS FOR 

30 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555. 
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In a preferred embodiment, p AcHLT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with S-methionine at 24 hours 
5 post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expression. 

For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID 
10 NO: 13) can be amplified by PCR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Ndel and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography columns 
15 and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
further purification and expression optimization in HiG5 cells. 

20 F. Expression of soluble forms of NgR2 and NgR3 as NgR-Ig fusion 

proteins. 

To generate a NgR2-Ig fusion protein, standard methods may be used as 
described in the literature (e.g. Sanicola et al (1997) Proc. Natl Acad Set USA. 94, 
6238-6243). For example, a DNA fragment encoding NgR2 without the sequence 

25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fragment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fusion protein. The fusion protein is then purified from 

30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directly onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 
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0.5 M NaCl, and 25 mM sodium phosphate, 100 mM NaCl (pH 5.0). The bound 
protein may then be eluted with 25 mM NaH 2 P0 4 , 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M Na 2 HP0 4 (pH 8.6). 
Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et al 
(1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A 

1 0 protocol is published in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 1 999, John 

Wiley & Sons, NY and Ausubel, F. M. etal. 1992, SHORT PROTOCOLS IN MOLECULAR 
BIOLOGY, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available from Clontech, Palo Alto, CA 
(Matchmaker Two-Hybrid System 3). 

15 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e., 
pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming 

20 cDNA libraries, see Sambrook et al 1989, MOLECULAR CLONING: ALABORATORY 
Manual, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgR 
fusion construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 

25 analysis. Similar controls are performed with the AD/library fusion construct to ensure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids 
according to standard procedure (Ausubel, et al 9 1992, SHORT PROTOCOLS IN 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 

30 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (i.e., lacZ, HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
p-galactosidase activity upon growth in Xgal 

(5-bromo-4-cWoro-3-indolyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden et al, (1985) Cold Spring Harb. Symp. 
5 Quant BioL, 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion 
proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced 
to verify the presence of an open reading frame fused to GAL4 AD and to determine 
1 0 the identity of the NgR-binding protein. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding fragments 

1 5 thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook et al. (1989), above, and Harlow et al (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 

20 the antibodies. In another embodiment, one or more peptides having amino acid 

sequences corresponding to an immunogenic portion of NgR (e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mixed with an adjuvant or linked to a 

25 hapten to increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic fragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manufacturer's recommendations. For an initial injection, the antigen is emulsified with 
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Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
cample is taken from the immunized mice and assayed by western blot to confirm the 
5 presence of antibodies that immunoreact with NgR. Serum from the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 

10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 jig/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifiigation and resuspended in serum-free 
RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar 

1 5 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifiiged and washed as well. 

To produce hybridoma fusions, spleen cells from the immunized mice are 
combined with NS-1 cells and centrifiiged, and the supernatant is aspirated. The cell 

20 pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells are centrifiiged, resuspended in 
RPMI containing 15% FBS, 100 jiM sodium hypoxanthine, 0.4 \iM aminopterin, 16 
\lM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10 6 

25 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates 
(Corning, Corning, NY). 

On days 2, 4, and 6 after the fusion, 100 pi of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 
screened by ELIS A, testing for the presence of mouse IgG that binds to NgR 

30 Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 



BNSDOCID: <WO 0229059A2_I_> 



WO 02/29059 



PCT/US01/31488 



- 125 - 

B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential of NgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 
5 NgR antagonists. Following are protocols to improve the utility of anti-NgR 

monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in human 
hosts (i.e., to prevent human antibody response to non-human anti-NgR antibodies). 

The principles of humanization have been described in the literature and are 
10 facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 

15 with the constant domains of human antibody molecules. (See, e.g., Morrison et aL, 
(1989) Adv. Immunol. 9 44, 65-92). The variable domains of NgR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 

20 the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 

25 into human antibody sequences. (See, e.g., Jones et aL, (1986) Nature 321, 522-525; 
Riechmann et aL, (1988) Nature 332, 323-327; Verhoeyen et aL, (1988) Science 239, 
1534-1536 ; and Tempest et aL, (1991) Bio/Technology 9, 266-271). If necessary, the 
B-sheet framework of the human antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 

30 domain of the original monoclonal antibody. (See Kettleborough et aL, (1991) Protein 
Engin. 4, 773-783; and Foote et aL, (1992) J. Mol. Biol. 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-human antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) Mol Immunol 28, 489-498 . 
5 The foregoing approaches are employed using NgR-neutralizing anti-NgR 

monoclonal antibodies and the hybridomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame et al (1997) Human Antibodies 8, 1 55-168; 
Hoogenboom (1997) TBBTECH 15, 62-70; and Rader etal (1997), Curr. Opin. 
Biotechnol. 8, 503-508, all of which are incorporated by reference. For example, 

1 5 antibody variable regions in the form of Fab fragments or linked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pin. Expression of the fusion protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage library comprising such constructs is 

20 expressed in bacteria, and the library is screened for NgR-specific phage-antibodies 
using labeled or immobilized NgR as antigen-probe. 

D. Human NgR-neutralizing antibodies from transgenic mice 

Human NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al (1996) Immunol Today 17, 391-397 and 

Bruggemann et al (1997) Curr. Opin. Biotechnol 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies {e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 
5 analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are useful for 
identifying NgR in tissue samples {e.g., for diagnostic purposes, pathological purposes, 
10 and the like). Agonist and antagonist modulators are useful for up-regulating and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in NgR-transfected cells that have been exposed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 

20 e.g., Sutherland et aL, (1968) Circulation 37, 279; Frandsen et aL, (1976) Life 

Sciences 18, 529-541; Dooley etaL, (1997)7. Pharmacol Exp. Therap. 283, 735-41; 
and George et aL, (1997) J. Biomol Screening 2, 235-40). An exemplary protocol for 
such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN™ 
Life Science Products, is set forth below. 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 

30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds {i.e., candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 
5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing [ 125 I]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ I25 I]-cAMP compete for antibody bound to the 

10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the G s subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 

1 5 levels. Agonists of receptors which couple to the Gy Q subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutively active or activated by known agonists. 

20 B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with both 
a NgR expression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable 
luminescence that is proportional to the amount of intracellular (cytoplasmic) free 

25 calcium. (See generally, Cobbold, et al "Aequorin measurements of cytoplasmic free 
calcium," In. McCormack J.G. and Cobbold P.H., eds., CELLULAR CALCIUM: A 
Practical Approach. Oxford:IRL Press (1991); Stables et al, (1997) Anal 
Biochem. 252, 1 15-26; and Haugland, Handbook of Fluorescent Probes and 
Research Chemicals. Sixth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial expression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 37°C in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
5 U/ml penicillin and 10 jig/ml streptomycin, at which time the medium is changed to 
serum-free MEM containing 5 [lM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

10 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 jil/well. Plates 
are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 \il cell suspensions into 
each well, one well at a time, and immediately read luminescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC 50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the G q subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 
as inverse agonists will reverse this effect at receptors that are either constitutively 
active or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

25 The photoprotein luciferase provides another useful tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct (e.g., NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B. Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George et al (1997) J. Biomol. Screening 2, 235- 
240; and Stratowa et al. (1995) Curr. Opin. BiotechnoL 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e.g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 [ig/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
luciferase may be purchased from Stratagene (Legally, CA). Transfections are 

10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37°C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

15 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 \i\ of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After incubation for 15 minutes at room temperature, 15 pi of 
the lysate is mixed with 50 pi of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3-20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 



D. Intracellular calcium measurement using FLJPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x 10 4 cells/well in Packard black-walled, 96-well plates 
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specially designed to discriminate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37°C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 
5 AM, Fluo-4™ AM, Calcium Green™- 1 AM, or Oregon Green™ 488 BAPTA-1 AM), 
each at a concentration of 4 jiM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and incubated for 10 minutes at 37 a C to remove 
residual dye from the cellular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 

10 the calcium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23187 (10 pM; positive control), or 
ATP (4 pM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al. 

15 (1998) Drug Dev. Res. 44,14-20). The F-stop for the detector camera is set at 2.5 and 
the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23 187, and the basal 
fluorescence level is subtracted from the response signal. The calcium signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calcium 

20 ionophore A23 187 and ATP increase the calcium signal 200% above baseline levels. 
In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. [ 35 S]GTPyS Binding Assay 

25 It is also possible to evaluate whether NgR signals through a G 

protein-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins whose activity involves GTP binding and hydrolysis to yield 
bound GDP, measurement of binding of the non-hydrolyzable GTP analog [ 35 S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 

30 for modulator activity. (See, e.g., Kowal et al., (1998) Neuropharmacology 37, 179- 
187.). 
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In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca 2+ /Mg 2+ -free phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifiigation (500 xg, 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 

nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml 
TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifuged at 20,000 x g for 20 minutes to 

10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl 2 , 
1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70 °C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 

15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mMNaCl, 10 pM GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 p-g/ml. In a final volume of 90 pi, homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 [iM GTP for 
30 minutes at 30°C and then placed on ice. To each sample, 10 pi guanosine 

20 5 , -0-(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTPyS), was added to a 

final concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4°C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed with 

25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl 2 . Filters are counted by liquid 
scintillation spectroscopy. Nonspecific binding of [ S]-GTPyS is measured in the 
presence of 100 jiM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [ 35 S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gives up to a five-fold increase in 

30 [ 35 S]-GTPyS binding. This response is blocked by antagonists. 

F. [ 3 H] Arachidonic Acid Release 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators of NgR activity. (See, e.g., 
Kanterman et al, (1991) Mol Pharmacol 39,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24-well plates at a 
5 density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 |ig/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[ 3 H]-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 |xCi/ml in 1 ml MEM 
supplemented with 10 mMHEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum 
10 albumin for 2 hours at 37*C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 jiM ATP and the cells are incubated at 37°C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
15 well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [ 3 H]-arachidonic 
acid. This potentiation is blocked by antagonists. 

Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators of NgR activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e.g., Dunlop et al. (1998) J. Pharmacol. Toxicol. Meth. 40, 47-55). In one 
embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10 5 cells/cup in MEM 

25 supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 
10 p.g/ml streptomycin. The cells are incubated in this medium at 37°C in 5% C0 2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 p.g/ml streptomycin, 26 mMNaCl) at a flow rate of 100 jil/minute. 
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Caadidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

15 Example 15: mNgR3 does not bind hNogo-A(1055-1120) 

To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(1055-l 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3 protein was created. The mouse NgR3 cDNA was amplified by PCR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 

20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature mNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a myc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-1120) binding studies 
and myc immunohistology were conducted as described (Fournier et al., supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but binding to 

AP-hNogo-A(1055-l 120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 
The tblastn program was used to search for the human homolog of mouse 
30 NgR3. The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a 

proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 3 12 amino acids in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID NO:4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R35699; Version number: 
R35699.1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that carboxy terminal end, the I.M.A.G.E. Consortium clone No. 38319, 
which corresponds to Genbank accession No. R35699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/HinD III fragment containing the sequence of interest, cloned into the 

1 5 Notl/HinD m sites of the vector Lafinid B A 

(http://image.llnL gov/image/^ The clone was received as an 

agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 

20 Purification System (Promega #A7500). These DNAs were subsequently digested 
with NotI and HinD HI restriction enzymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 

25 sequence for human NgR3 of 392 amino acids (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989.1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 

30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989. 1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989.1). This results in two changes in amino 
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acids when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989.1: SEQ ID NO: 14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO: 14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989. 1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R3 5699.1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
positions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R3 5699.1; GI: 792600) This leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucine. 

15 A Bestfit analysis of the partial human amino acid sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 amino acids, it can be determined that the 

20 human NgR3 protein contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential glycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 

25 As those skilled in the art will appreciate, numerous changes and modifications 

may be made to the preferred embodiments of the invention without departing from the 
spirit of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional application 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 

SEQ ID NO: 1 human NgR2 cDNA sequence derived from genomic sequence 

AC013606 

SEQ ID NO:2 human NgR2 amino acid sequence 

5 SEQ ID NO:3 mouse NgR3 cDNA sequence derived from AC021768 

SEQ ID NO:4 a mouse NgR3 amino acid sequence 

SEQ ID NO: 5 a human NgRl amino acid sequence 

SEQ ID NO:6 a consensus amino acid sequence for NgRs 

SEQ ID NO:7 #1055-1 120 amino acid residues of hNogoA (Nogo-66) 

10 SEQ ID NO: 8 a mature human NgR2 amino acid sequence 

SEQ ID NO: 9 a mature mouse NgR3 amino acid sequence 

SEQ ID NO: 10 a consensus NgR LLRNT amino acid sequence 

SEQ ED NO: 11 a consensus NgR LRRCT domain amino acid sequence 

SEQ ID NO: 12 a consensus NgR LRR domain amino acid sequence 

1 5 SEQ ID NO: 13 a partial human NgR3 nucleotide sequence 

SEQ ID NO: 14 a partial human NgR3 amino acid sequence 

SEQ ID NO: 15 a genomic sequence encoding a human NgR2 sequence. 

SEQ ID NO: 16 a genomic sequence (complementary strand) encoding a mouse 

NgR3 

20 SEQ ID NO : 1 7 a mouse NgRl amino acid sequence 

SEQ ID NO: 18 a consensus sequence for the NTLRRCT domain of NgR 

SEQ ID NO: 19 an consensus NgR LRRCT domain amino acid sequence 



25 



30 
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CLAIMS 



What is claimed is: 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding 
a polypeptide comprising an LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWXsWX 6 X 7 X g X 9 R X 10 S S S X n V 

X 12 C X 13 X 14 P X 15 Xjg X 17 X 18 X 19 X 20 D L X 21 X 22 L X23 X 24 X 25 D 

X 26 X 27 X 28 C [SEQ ID NO: 19] 



15 wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 

2. The isolated nucleic acid according to claim 1, wherein X 17 and X^ 
20 are independently selected from the group consisting of: arginine and lysine. 

3. The isolated nucleic acid according to claim 2, wherein the amino 
acid sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-3 10 of SEQ ID NO:2 and residues 261-3 10 of SEQ ID NO: 2 with up to 10 

25 conservative amino acid substitutions. 

4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2. 

5. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 

6. The isolated nucleic acid according to claim 1, wherein the 
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polypeptide comprises: (a) aNTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
first 39 amino acids of the CTS domain. 

5 7. The isolated nucleic acid to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic acid consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide selected 
10 from the group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ LD 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid 
is from 8 to 100 nucleotides in length. 

15 9. A vector comprising the nucleic acid of any one of claims 1, 4 or 5. 

10. A host cell comprising a vector according to claim 9. 

11. A polypeptide comprising a LRRCT amino acid sequence: 

20 

N X x W X 2 C X 3 C R A- R X 4 L W Xs W X 7 X 8 X, R X 10 S S S X n V 

X 12 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 X 2 o D L X 21 X 22 L X^ X 24 X 25 D 

25 X 26 X 27 X 28 C [SEQ ID NO: 19] 

wherein X is any amino acid residue or a gap and the polypeptide does not 
comprise the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

30 

12. The polypeptide according to claim 1 1, wherein X 17 and X s is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 11, wherein X 19 is glycine. 
[SEQIDNO:ll] 

14. The polypeptide according to claim 11, wherein the amine acid 
sequence is selected from the group consisting of residues 261-310 of SEQ ID NO:2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID NO:4 and amino 
acid sequences thereof comprising a conservative substitution. 

15. A polypeptide comprising a NTLRRCT amino acid sequence: 



C P Xj X 2 C X 3 C Y X 4 X^ P X^ X 7 T X 8 S C X9 X 10 X n X 12 X 13 X 14 X^ X 16 P 
^17 X 18 X 19 P X 20 X 21 X^ X23 R X 24 F L X 25 X 26 N X 27 1 X 2g X 29 X 30 X 31 X 32 X 33 
X 34 F X 3S X 36 X 37 X 38 X 39 X^ X 41 X 42 L W X 43 X^ S N X 45 X 46 X 47 X 48 1 X 49 
X 50 X 51 X 52 F X 53 Xs 4 X 55 X 56 X^ 7 L E X 58 LDL X 59 D N X 60 X^ L X^ Xg 3 X^ 

15 X 65 P X^ T F Xg 7 G L Xgg X^ 9 L X 70 X 71 L X 72 L X^ X 74 C X 75 L X 76 X^ L X 78 

X 79 X 80 X 81 F X^ G L X 83 X 84 LQYL YLQ X 85 N X 86 X 87 X 88 X 89 L X^ D 
X^! Xg 2 F X 93 D L X 94 N L X 95 HLFLHGN Xgg X 97 X 98 X^ X 100 X 101 X 102 
X103 X 104 FRGL X 105 X 106 LDRLLLH X 107 N X 108 X 109 X 110 X m V H X U2 
X 113 A F X 114 X ll5 L X n6 R L X n7 X 118 L X 119 L F X 120 N X 121 L X 122 X 123 L 

20 X 124 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 13S W 

X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 14g X 149 P X 150 X 151 X 152 X 153 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
D X 161 X 162 X 163 C [SEQ ID NO:18] 

25 wherein X is any amino acid residue or a gap and wherein the polypeptide is 

not the polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 
(mouse NgRl). 

16. The polypeptide according to claim 15, wherein Xg, X 37 and X 38 
3 0 represents a gap . 

17. A polypeptide comprising an amino sequence selected from the 
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group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 1 1, 15 or 17, wherein 
the polypeptide comprises: (a) an NTLRRCT domain, and (b) less than a complete 

5 CTS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the amino acid sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherein the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 11, 15 or 17, comprising the steps of introducing an isolated nucleic acid 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 11, 15 

25 or 17. 

24. A composition comprising the polypeptide of claim 1 1, 15 or 17 
and a pharmaceutically acceptable carrier. 

30 25. A composition comprising the antibody of claim 23 and a 

pharmaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 11, 15 or 17. 



injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 

antibody according to claim 23 . 

29. A method of treating a central nervous system disease, disorder or 
injury, comprising administering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
claim 1 1, 15 or 17 comprising the steps of: 

(a) providing a polypeptide of claim 11, 15 or 17; 
20 (b) contacting the polypeptide with the candidate molecule; 



5 



27. A method of treating a central nervous system disease, disorder or 



and 



(c) 



detecting binding of the candidate molecule to the 
polypeptide. 
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SEQUENCE LISTING 



<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 
CATE, RICHARD L. 
SAH, DINAH W.Y. 

<120> NOGO RECEPTOR HOMOLOGS 

<130> A116PCT 

<140> 
<141> 

<150> 60/238,361 
<151> 2000-10-06 

<160> 16 

<170> Patentln Ver. 2.1 

<210> 17 
<211> 1260 
<212> DNA 

<213> Homo sapiens 
<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa 
ctcctggccc tgcccctggc ggcccccagc 
ccgcccaccg tgagctgcca ggccaacaac 
agcactcagc gactcttcct gcagaacaac 
gggtccaacc tgctcaccct gtggctcttc 
actttccgcc acttgcaagc cctggaggag 
tcgctggagc ccgacacctt ccagggcctg 
tgccagctca gcagcctgcc cggcaacatc 
tacctccagg agaacagcct gctccaccta 
ctgagccacc tcttcctcca cgggaaccgc 
ggcctgggca gcctggaccg gctgctgctg 
gcggccttcc gcggcctcag ccgcctcacc 
tcgctgcccg gcgaggcgct cgccgacctg 
aacccctggg cgtgcgactg ccgcgcgcgg 
gtgtccagct ccgacgtgac ctgcgccacc 
gcgctccgcg aggccgactt ccaggcgtgt 
cgcgcccgcg gcaacagctc ctccaaccac 
ccagccgatc cctccaccct ctaccgagat 
ggcggggacg cgcctactga ggacgactac 
ggggagcaga tgtgccccgg cgctgcctgc 
ctctcggccg ggctccccag ccctctgctt 



gctcccgcct cggcctgcct cctgctgatg 60 
tgccccatgc tctgcacctg ctactcatcc 120 
ttctcctctg tgccgctgtc cctgccaccc 180 
ctcatccgca cgctgcggcc aggcaccttt 24 0 
tccaacaacc tctccaccat ctacccgggc 300 
ctggacctcg gtgacaaccg gcacctgcgc 360 
gagcggctgc agtcgctgca tttgtaccgc 420 
ttccgaggcc tggtcagcct gcagtacctc 48 0 
caggatgact tgttcgcgga cctggccaac 540 
ctgcggctgc tcacagagca cgtgtttcgc 600 
cacgggaacc ggctgcaggg cgtgcaccgc 660 
atcctctacc tgttcaacaa cagcctggcc 720 
ccctcgctcg agttcctgcg gctcaacgct 780 
ccgctctggg cctggttcca gcgcgcgcgc 840 
cccccggagc gccagggccg agacctgcgc 900 
ccgcccgcgg cacccacgcg gccgggcagc 960 
ctgtacgggg tggccgaggc cggggcgccc 1020 
ctgcctgccg aagactcgcg ggggcgccag 1080 
tgggggggct acgggggtga ggaccagcga 114 0 
caggcgcccc cggactcccg aggccctgcg 1200 
tgcctcctgc tcctggtgcc ccaccacctc 1260 



<210> 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Leu Pro Gly Leu Arg Arg Leu Leu Gin Ala Pro Ala Ser Ala Cys 
1 5 10 15 

Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala Pro Ser Cys Pro 
20 25 30 
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Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys Gin Ala 
35 ~ ~ 40 45 

Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr Gin Arg 
50 55 60 

Leu Phe Leu Gin Asn Asn Leu lie Arg Thr Leu Arg Pro Gly Thr Phe 
65 70 75 80 

Gly bar Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 
85 90 95 

lie Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu Leu Asp 
100 105 110 

Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gin 
115 120 125 

Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin Leu Ser 
130 135 140 

Ser Leu Pro Gly Asn lie Phe Arg Gly Leu Val Ser Leu Gin Tyr Leu 
145 150 155 160 

Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu Phe Ala 
165 170 175 

Asp Leu Ala Asn Leu Ser His Leu Phe Leu His Gly Asn Arg Leu Arg 
180 185 190 

Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp Arg Leu 
195 200 205 

Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala Phe Arg 
210 "* 215 220 

Gly Leu Ser Arg Leu Thr lie Leu Tyr Leu Phe Asn Asn Ser Leu Ala 
225 230 235 240 

Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu 
245 250 255 

Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 
260 265 270 

Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val Thr Cys 
275 280 285 

Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu Arg Glu 
290 295 300 

Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro Gly Ser 
305 310 315 320 

Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu 
325 330 335 

Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 
340 345 350 

Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr Glu Asp 
355 360 " 365 

Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu Gin Met 
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370 



375 



380 



Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 
385 " 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 
405 410 415 

Pro His His Leu 
420 



<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 



<400> 3 

atgtcttggc 

gggtgctgtg 

tgtcctcgag 

tttgctgcca 

cgcatcacct 

tccaacaaca 

ctagaccttg 

gtgaagcttc 

tttggtggcc 

caagatgaca 

ctatggagcc 

catgagaacc 

accctctttc 

gtggccttgg 

tccctgtggg 

cccgagctgc 

acaggaccag 

gcccgcaagg 

catccgcctg 

aaccggaacc 

gcccccgact 

aagggcaagt 

tcaggcacgg 

cgc 



agtctggaac 
tggaattgct 
actgtgtgtg 
tcccggaggg 
tcctccagca 
tcactttcat 
gagacaaccg 
acgccctcta 
tgcacagcct 
tctttgtgga 
tgggccaagg 
agctacagtg 
tcttcaacaa 
agttccttcg 
aatggctgcg 
ggcaaggcca 
tgtctcctca 
agcaccatcc 
gctccaggtc 
agatctctaa 
atcagcacaa 
gtgctcgcag 
cccttggggc 



cacagtgaca 
gctgttgctg 
ctaccctgcg 
catcccagag 
gggccacttc 
tgctcccaac 
acagctgcga 
cctctataag 
gcagtatctc 
cctggtcaat 
catcttccgg 
ggttcaccac 
cagcctcact 
cctcaatggg 
aaggttccgt 
ggatctgaag 
ccagatcaag 
gtcccatggg 
aggttacaag 
ggtgagctct 
gttcagcttt 
gacccccatc 
cccactcctg 



caatctcccg 
ctcgctggag 
cccatgactg 
gacagtgagc 
agccccgcca 
accttcgagg 
acgctggcac 
tgtggactga 
tacttgcagg 
ctcagtcact 
ggcctggtga 
aaggctttcc 
gagctgcagg 
aatgcttggg 
ggctctagct 
ctgctgaggg 
tctcacacgc 
gcctccaggg 
aaggcaggca 
gggaaagagc 
gacatcatgc 
cgtgccccca 
gcctggatac 



tgcaggctgc 
agctacctct 
tcagctgcca 
gcatcttcct 
tggtcaccct 
gctttgtgca 
ccgagacctt 
gcgccctgcc 
acaaccatat 
tgtttctcca 
acctggaccg 
atgacctcca 
gtgactgtct 
actgtggctg 
ctgctgtccc 
tggaggactt 
ttaccacctc 
acaaaggcca 
agaactgcac 
ttaccgaact 
ccaccgcacg 
gtggggtgca 
tggggctggc 



tcaggtctca 
gggtggtggt 
ggcacacaac 
gcagaacaat 
ctggatctac 
tctggaggag 
ccaaggcctg 
cgcaggcatc 
cgagtacctc 
tggtaacaag 
gttgctgctg 
caggctaacc 
ggcccccctg 
ccgggcacgt 
ctgcgcgacc 
ccggaactgc 
tgacagggct 
cccacatggc 
cagccacagg 
gcaggactat 
acccaagagg 
gcaggcatcc 
agtcactctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1383 



<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
1 ~ 5 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 ~ ~ ^ 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 ~ 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala lie 
50 55 ~ 60 

Pro Glu Gly lie Pro Glu Asp Ser Glu Arg lie Phe Leu Gin Asn Asn 
65 70 75 80 
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Arg He Thr Phe Leu Gin Gin Gly His Phe Ser Pro Ala Met Val Thr 
85 90 95 

Leu Trp He Tyr Ser Asn Asn lie Thr Phe He Ala Pro Asn Thr Phe 
100 105 HO 

Glu Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin 
115 120 125 

Leu Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His 
130 135 140 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly He 
145 " 150 ~ 155 160 

Phe Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His 
165 170 175 

He Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser 
180 185 190 

His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gin Gly He 
195 200 205 

Phe Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin 
210 1 215 220 

Leu Gin Trp Val His His Lys Ala Phe His Asp Leu His Arg Leu Thr 
225 230 235 240 

Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gin Gly Asp Cys 
245 250 255 

Leu Ala Pro Leu Val Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 
260 265 270 

Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Arg Arg 
275 "* " 280 285 

Phe Arg Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 
290 ^ 295 300 

Gin Gly Gin Asp Leu Lys Leu Leu Arg Val Glu Asp Phe Arg Asn Cys 
305 310 315 320 

Thr Gly Pro Val Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr 
325 330 335 

Ser Asp Arg Ala Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 
340 345 350 

Arg Asp Lys Gly His Pro His Gly His Pro Pro Gly Ser Arg Ser Gly 
355 360 365 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gin 
370 375 380 

He Ser Lys Val Ser Ser Gly Lys Glu Leu Thr Glu Leu Gin Asp Tyr 
385 390 395 400 

Ala Pro Asp Tyr Gin His Lys Phe Ser Phe Asp He Met Pro Thr Ala 
4 05 410 415 

Arg Pro Lys Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala 
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420 425 430 

Pro Ser Gly Val Gin Gin Ala Ser Ser Gly Thr Ala Leu Gly Ala Pro 
435 440 445 

Leu Leu Ala Trp lie Leu Gly Leu Ala Val Thr Leu Arg 
450 455 460 



<210> 5 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Gin Val Ala Ala Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 4 0 45 

Gin Ala Val Pro Val Gly lie Pro Ala Ala Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 
65 " 7 0 75 80 

Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Val Leu Ala Arg lie 

85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly 
115 120 125 

Leu Gly Arg Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Gin Glu 
130 " 135 * 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 * 155 160 

Leu Gin Asp Asn Ala Leu Gin Ala Leu Pro Asp Asp Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Ser Ser 
180 185 190 

Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 ~ 200 205 

Leu His Gin Asn Arg Val Ala His Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Ala 
225 ~ ~ 230 235 240 

Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
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260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Ser 
275 280 285 

Leu Pro Gin Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 
290 295 300 

Asp Leu Gin Gly Cys Ala Val Ala Thr Gly Pro Tyr His Pro He Trp 
305 310 315 320 

Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Ser 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His He Asn Asp Ser Pro Phe 
370 "* 375 38 0 

Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro Leu Thr Ala Val Arg Pro 
385 390 395 400 

Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser Gly Pro Arg Arg Arg 
405 410 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
420 425 430 

Gin Ala Gly Ser Gly Gly Gly Gly Thr Gly Asp Ser Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 



<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> M0D_RES 

<222> (3) . • (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (9) . . (10) 
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<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<22 0> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (27) . . (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (31) . . (34) 

<223> Variable amino acid 

<22 0> 

<221> MOD_RES 
<222> (36) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (39) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (44) . . (50) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (52) . . (59) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (62) . . (63) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (66) . - (69) 

<22 3> Variable amino acid 

<220> 

<221> MOD RES 
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<222> (71) . . (74) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (76) . . (80) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (83) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (87) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (90) . . (91) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (94) . . (96) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (98) 

<223> Variable amino acid 
<220> 

<221> MOD_JRES 
<222> (101) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104) . . (105) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (107) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (111) . . (112) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 
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<221> MOD_RES 

<222> (116) . . (117) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (119) . . (122) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (124) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (127) . . (128) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (136) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (138) . . (141) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (143) 

<223> Variable amino acid 
<220> 

<221> MOD__RES 
<222> (146) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (151) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (154) 

<22 3> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (162) . . (170) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (175) . . (176) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (184) 

<223> Variable ainino acid 
<220> 

<221> M0D_RES 

<222> (186) . . (189) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (196) . . (197) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (199) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (202) . . (203) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (205) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (210) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 

<222> (212) . . (213) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (215) . - (218) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (221) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (223) . . (224) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (226) . . (227) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (232) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (234) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (236) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (238) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (243) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (246) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (248) . . (251) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (253) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (257) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (259) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (261) . . (262) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (264) . . (267) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (272) . . (273) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 

<222> (275) . . (277) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (279) . , (281) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (283) . . (287) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (289) . . (290) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (292) . . (328) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (330) . . (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (344) . . (346) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (348) . . (399) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (401) . . (428) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr 
1 5 



Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa 
20 25 30 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn Xaa lie Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser 
50 55 60 

Asn Xaa Xaa Xaa Xaa lie Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 
85 90 95 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa 
100 105 110 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 
115 120 125 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp 
130 " 135 140 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly 
145 150 155 160 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe" Arg Gly Leu Xaa Xaa 
165 170 175 

Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa 
180 185 190 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa 
195 200 205 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 
210 215 220 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys Arg 
225 230 235 240 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser 
245 250 255 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 
260 265 270 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Pro 
275 280 285 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 
340 345 350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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355 360 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
385 390 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
420 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 
435 440 



365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg 
395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
410 415 

Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
425 430 



<210> 7 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Arg lie Tyr Lys Gly Val lie Gin Ala lie Gin Lys Ser Asp Glu Gly 
1 5 10 15 

His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala He Ser Glu Glu 
20 25 30 

Leu Val Gin Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 
35 ~ 40 45 

He Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser 
50 " 55 60 

Leu Lys 
65 



<210> 8 

<211> 390 

<212> PRT 

<213> Homo sapeins 

<400> 8 

Cys Pro Met Leu Cys 
1 5 

Gin Ala Asn Asn Phe 
20 

Gin Arg Leu Phe Leu 

35 

Thr Phe Gly Ser Asn 
50 

Ser Thr He Tyr Pro 
65 

Leu Asp Leu Gly Asp 
85 



Thr Cys Tyr Ser Ser Pro 
10 



Ser Ser Val Pro Leu Ser 
25 

Gin Asn Asn Leu lie Arg 
40 

Leu Leu Thr Leu Trp Leu 
55 

Gly Thr Phe Arg His Leu 
70 75 

Asn Arg His Leu Arg Ser 

90 



Pro Thr Val Ser Cys 
15 



Leu Pro Pro Ser Thr 
30 



Thr Leu Arg Pro Gly 
45 

Phe Ser Asn Asn Leu 
60 

Gin Ala Leu Glu Glu 
80 

Leu Glu Pro Asp Thr 
95 
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Phe Gin Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin 
100 105 110 

Leu Ser Ser Leu Pro Gly Asn lie Phe Arg Gly Leu Val Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu 
130 135 140 

Phe Ala Asp Leu Ala Asn Leu Ser His Leu Phe Leu His dly Asn Arg 
145 150 155 160 

Leu Arg Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp 
165 170 175 

Arg Leu Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala 
180 " 185 190 

Phe Arg Gly Leu Ser Arg Leu Thr lie Leu Tyr Leu Phe Asn Asn Ser 
195 200 205 

Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu 
210 "* 215 220 

Phe Leu Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg 
225 230 235 240 

Pro Leu Trp Ala Trp Phe Gin Arg Ala Arg Val" Ser Ser Ser Asp Val 
245 250 255 

Thr Cys Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu 
260 265 270 

Arg Glu Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 
275 280 285 

Gly Ser Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val 
290 295 300 

Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp 
305 " 310 ~ 315 320 

Leu Pro Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr 
325 330 335 

Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu 
340 345 350 

Gin Met Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly 
355 360 365 

Pro Ala Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu 
370 375 380 

Leu Val Pro His His Leu 
385 390 



<210> 9 
<211> 421 
<212> PRT 
<213> Mus sp. 

<400> 9 
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Cys Pro Arg Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 
1 5 10 15 

Gin Ala His Asn Phe Ala Ala lie Pro Glu Gly lie Pro Glu Asp Ser 
20 25 30 

Glu Arg lie Phe Leu Gin Asn Asn Arg lie Thr Phe Leu Gin Gin Gly 
35 40 45 

His Phe Ser Pro Ala Met Val Thr Leu Trp lie Tyr Ser Asn Asn lie 
50 55 60 

Thr Phe lie Ala Pro Asn Thr Phe Glu Gly Phe Val His Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg Gin Leu Arg Thr Leu Ala Pro Glu Thr 
85 90 95 

Phe Gin Gly Leu Val Lys Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 
100 105 110 

Leu Ser Ala Leu Pro Ala Gly lie Phe Gly Gly Leu His Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Asp Asn His lie Glu Tyr Leu Gin Asp Asp lie 
130 135 140 

Phe Val Asp Leu Val Asn Leu Ser His Leu Phe Leu His Gly Asn Lys 
145 150 155' 160 

Leu Trp Ser Leu Gly Gin Gly lie Phe Arg Gly Leu Val Asn Leu Asp 
165 "* 17 0 17 5 

Arg Leu Leu Leu His Glu Asn Gin Leu Gin Trp Val His His Lys Ala 
180 185 190 

Phe His Asp Leu His Arg Leu Thr Thr Leu Phe Leu Phe Asn Asn Ser 
195 200 205 

Leu Thr Glu Leu Gin Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp Cys Gly Cys Arg Ala Arg 
225 ~ 230 235 240 

Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala Val 
245 250 255 

Pro Cys Ala Thr Pro Glu Leu Arg Gin Gly Gin Asp Leu Lys Leu Leu 
260 265 270 

Arg Val Glu Asp Phe Arg Asn Cys Thr Gly Pro Val Ser Pro His Gin 
275 280 285 

lie Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 
290 295 300 

His His Pro Ser His Gly Ala Ser Arg Asp Lys Gly His Pro His Gly 
305 310 315 320 

His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys Ala Gly Lys Asn Cys 
325 330 335 

Thr Ser His Arg Asn Arg Asn Gin lie Ser Lys Val Ser Ser Gly Lys 
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340 345 350 

Glu Leu Thr Glu Leu Gin Asp Tyr Ala Pro Asp Tyr Gin His Lys Phe 
355 360 365 

Ser Phe Asp He Met Pro Thr Ala Arg Pro Lys Arg Lys Gly Lys Cys 
370 375 380 

Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin Gin Ala Ser 
385 390 395 400 

Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu Ala Trp He Leu Gly Leu 
405 ~ 410 ~ 415 

Ala Val Thr Leu Arg 
420 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (9) . . (10) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<400> 10 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
1 5 10 15 

Cys 



<210> 11 

<211> 50 

<212> PRT 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (14) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (16) . . (19) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 
<222> (21) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (25) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (27) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (29) . . (30) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (37) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (40) . . (41) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (43) . . (45) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (47) . . (49) 

<223> Variable amino acid 

<400> 11 

Asn Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 
15 10 15 

Xaa Xaa Xaa Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 
20 25 30 

Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa 
35 40 45 

Xaa Cys 
50 



<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
sequence 



Consensus 



<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (5) . . (6) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (8) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (10) . . (16) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 
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<221> MOD_RES 

<222> (28).. (29) 

<223> Variable amino acid 



<220> 

<221> MOD__RES 

<222> (32) . . (35) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (37) . . (40) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (42) . . (46) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (49) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (53) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (56) . . (57) 

<22 3> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (60) . . (62) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (64) 

<223> Variable amino acid 



<220> 

<221> MOD__RES 
<222> (67) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (70) . . (71) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (73) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (75) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (77) . . (78) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (80) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (82) . . (83) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (85) . . (88) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (90) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (93) . . (94) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (102) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (104) . . (107) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (112) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (117) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (120) 

<223> Variable amino acid 
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<220> 

<221> M0D_RES 

<222> (128) . . (136) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (141) . . (142) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (150) 

<22 3> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (152).. (155) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (158) . . (159) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (162) . . (163) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (165) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (168) . . (169) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (171) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (174) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (176) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (178) . . (179) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (181) . . (184) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (187) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (189).. (190) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<400> 12 

Arg Xaa Phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa Cys Xaa 
65 " 70 75 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
100 105 HO 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 

Xaa Leu Arg Leu 
195 



<210> 13 

<211> 1176 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gagggcatcc ccgtggacag cgagcgcgtc ttcctgcaga acaaccgcat cggcctcctc 60 
cagcccggcc acttcagccc cgccatggtc accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc 
aaccggcagc 
ctctacctct 
agcctgcagt 
gtggacctgg 
ccgggcacct 
cagtgggtcc 
aacaacagcc 
ctccgcctca 
ctgcagaggt 
ggccaggacc 
ccgcaccaga 
cactcacccc 
aggaagccgg 
gccgggaaac 
tttgacatca 
atccgtgccc 
ctggcctgga 



ccagcacctt 
tgcggacgct 
acaagtgtgg 
acctctacct 
tcaacctcag 
tccggggcct 
accacaaggc 
tctcggagct 
acggcaaccc 
tccggggctc 
tgaagctgct 
tcaagtcaca 
acggccccac 
ggaagaactg 
aggcccccga 
tgcctacggc 
ccagcggggt 
cactggggct 



cgagggcttc 
ggcacccgag 
gctcagcgcc 
gcaggacaac 
ccacctgttt 
ggtgaacctg 
attccacgac 
gcagggtgag 
ctgggactgt 
cagctccgct 
gagggccgag 
cacgctcacc 
caggagcaag 
caccaacccc 
gctgccagac 
ccggcccaag 
gcagcaggcc 
ggcggtcact 



gtgcacctgg 
accttccagg 
ttgccggccg 
cacatcgagt 
ctccacggca 
gaccgtcttt 
ctccgcaggc 
tgcctggccc 
ggttgtcgcg 
gtcccctgtg 
gacttccgga 
accaccgaca 
ggccacccgc 
aggaaccgca 
tatgccccag 
aggaagggca 
tcctcggcca 
ctccgc 



aggagctgga 
gcctggtgaa 
gcgtctttgg 
acctccagga 
acaagctgtg 
tgctgcacga 
tgaccaccct 
cgctgggggc 
cgcgctccct 
tgtcccctgg 
actgcacggg 
gggccgcccg 
acggcccccg 
atcagatctc 
actaccagca 
agtgtgcccg 
gttccctggg 



cctcggcgac 
gcttcacgcc 
cggcctgcac 
cgacatcttc 
gagtctgggc 
gaaccagctg 
cttcctcttc 
cctggagttc 
gtgggaatgg 
gctgcggcac 
accagcgtcc 
caaggaacac 
gcccggccac 
taaggcgggc 
caagttcagt 
caggaccccc 
ggcctccctc 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1176 



<210> 14 

<211> 392 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Glu Gly lie Pro Val Asp Ser Glu Arg Val Phe Leu Gin Asn Asn Arg 
1 5 10 15 

lie Gly Leu Leu Gin Pro Gly His Phe Ser Pro Ala Met Val Thr Leu 
20 25 30 

Trp lie Tyr Ser Asn Asn lie Thr Tyr lie His Pro Ser Thr Phe Glu 
35 4 0 45 

Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin Leu 
50 55 60 

Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His Ala 
65 70 75 80 

Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 
85 ~ 90 95 

Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His lie 
100 105 110 

Glu Tyr Leu Gin Asp Asp lie Phe Val Asp Leu Val Asn Leu Ser His 
115 120 125 

Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Pro Gly Thr Phe 
130 135 140 

Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin Leu 
145 " 150 155 160 

Gin Trp Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr 
165 170 175 

Leu Phe Leu Phe Asn Asn Ser Leu Ser Glu Leu Gin Gly Glu Cys Leu 
180 185 190 

Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Pro Trp 
195 "* 200 205 
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Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Gin Arg Phe 
210 ~ " 215 220 

Arg Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro Gly Leu Arg His 
225 230 235 240 

Gly Gin Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr 
245 250 255 

Gly Pro Ala Ser Pro His Gin He Lys Ser His Thr Leu thr Thr Thr 
260 265 270 

Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His Gly Pro Thr Arg 
275 280 285 

Ser Lys Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 
290 ^ 295 300 

Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gin He Ser Lys Ala Gly 
305 310 315 320 

Ala Gly Lys Gin Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gin 
325 330 335 

His Lys Phe Ser Phe Asp He Met Pro Thr Ala Arg Pro Lys Arg Lys 
340 345 350 

Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin 
355 360 365 

Gin Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 
370 375 380 

Leu Gly Leu Ala Val Thr Leu Arg 
385 390 



<210> 15 

<211> 143899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modified__base 

<222> (2044) . . (2144) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (6609) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (6625) (6724) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (14153) . . (14252) 

<223> a, t, c, g, other or unknown • 

<220> 

<221> modified base 



25 



WO 02/29059 



r 

PCT/US01/31488 



<222> (19512) . . (19611) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (22595) . . (22694) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied__base 

<222> (27825) (27924) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (34953) . . (35052) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (49000) . . (49099) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62884) . . (62983) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (75528) - . (75627) 

<223> a, t, o r g, other or unknown 

<220> 

<221> modified_base 

<222> (87944) - • (88043) 

<223> sl, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (111030) (111129) 

<223> a, t f c, g, other or unknown 



<400> 15 

aagcacatac aggtgacatt acagaactga cagttatgcc aggcactgta cttagcccct 60 

ataccatcct caaacagctg tatgatgtag attgggtatt aaccccatta ataacaaaag 120 

tacagggaac aaagtgactt tccaaaggtc atgccattca aaggagggtg aatcttaggt 180 

tggacgcagg ctgtctgact ctggagtctg aggtgttaat gctgcctcct ccatgggaac 240 

agcccaagtg aaaaacagct gatccactct tcatttactt ggcatctgtg ctaagctggt 300 

ccctgagcca agctctgagc aacagaaaca gaagctctgc attaggagct tgtgagcatg 360 

tcaatgccgg gtaaaggagt gctggaaacc gctgggatgg ccgccgagca ctaggccgtt 420 

gaaggtgggc tctgtgtgac tggttcctct acactctggc ctggctgcct gcaggaagaa 480 

gatcaagctg agtgggctgg ccctggacca caaggtgaca ggtgacctct tctacaccca 54 0 

tgtgaccacc atgggccaga ggctcagcca gaaggccccc agcctggagg acggttcgga 600 

tgccttcatg tcaccccagg atgttcgggg cacctcagaa aaccttcctg agagtgagtg 660 

tctggtcaag gtgccggcct tgggggatag tgatggtggg tcctcatatt cagtgagcac 720 

tcatggttga gtatttattc gcacccctct tcagtcctta caacacccca tgatgtaggt 780 

ggggcatgct cctcatttac agatgggcac atcaaagctc agctaacgct gggaagttca 840 

gattcagggt taccctgctg gattcctggg attggggagg gaggagcttc caaaatgggg 900 
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acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca acctctgagc 960 

ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc cacacctggc 1020 

tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg ctgcttctca 1080 

ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg agctccttgg 1140 

agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc tatgtgactt 1200 

gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca ctaaaagcac 1260 

cccacaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg gaagctcttt 1320 

gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga ttctcagagg 1380 

gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca gccctgtgag 1440 

cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa tgtgataagg 1500 

atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg tagcccatag 1560 

tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga tggagcctta 1620 

ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac ctccgcctcc 1680 

cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca ggcgtgcacc 17 40 

accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat gttggccagg 18 00 

ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag ttttgggatt 1860 

acaagtgtga gcccccgtgc cctgccaggt tttttttttt tttttttttt tgtaaaacac 1920 

ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag ttcactgcag 1980 

ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag ctgggaatac 2040 

aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta ttttaagtag 2160 

agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct caggtgatcc 2220 

acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg cccagccagg 2280 

aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg atgcaagaac 2340 

agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta agatgtcaag 2400 

gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt tttctttttt 2460 

cagtagaaga tgttatcttc tcctttctct gaccagtagt- tggtgatggt ttcagagata 2520 

gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct cgcctctcta 2580 

ggactttatt tccttgtttc tagaagaggg ataacacata ttttctaagg tggttgtgag 2640 

attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg tgagaccact 27 00 

caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca cagtgggagt 2760 

ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag aaaacccagc 282 0 

cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc tgaggcaggt 2880 

ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaacc ccacctctac 2 940 

taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc tactcaggag 3000 

gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc cgagatcgca 3060 

ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa aaaaaaaaag 3120 

aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat tatccatctc 3180 

aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg gcttaacgat 3240 

gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc cagcttcatc 3300 

ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca gacgctgcct 3360 

gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg aatgaaagag 3420 

tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac tgtcaccagg 3480 

aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg ggcaaggagt 3540 

gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga gatcaaactc 3600 

caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga gatacaaaac 3660 

acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga catcaaatgt 3720 

agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt gatttttttt 378 0 

aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt ggtttgatca 3840 

tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc aagtgatcct 3900 

cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc tcagttaatt 3960 

tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct ggaattcctg 4020 

aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg catgagccac 4 080 

tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg agcacggggc 414 0 

caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg gtttcttaaa 4200 

ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt ctttctttct 4260 

ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct ggagtgcagt 4320 

ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc tcctgcctca 4380 

gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat ttttgtgttt 4440 

ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct gaccttgtga 4500 

tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg gcgcccagcc 4560 

agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca cctggcccac 4620 

tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta cgtaagttat 4 680 

ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta aactccatga 4740 
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gaatagaaat ttttgccttt ttatttagtg tggtgtctgc agcccctggc ttagtccctg 4 8 00 

gcatacaaca gtcactccac ccacagttgc tgaataagtg actaaaggtc cctgccctca 4860 

tattgttatg agggagtgtg catgttgtta gagaaaaatc tgaggcacaa taaaatttta 4 920 

tagagtttaa gttttctttt ttaagcaatc cacgaattgg ggtagtttca gaggtagttt 4 980 

ttcagtcatg acgtatttca atggcttcac tgagcccaag ttctttcacc tctctaggac 5040 

tttatttcct tatttctaga acggggataa cacatagttc ataaggcagt tatgagagta 5100 

agggagctgg tatggggtga tgcataagga caggatagag cagtggtgag accgctcaga 5160 

tgacaaagcg tcagagacca gtatttacga cggaaatgtg gaagcatgat aaagaaatta 5220 

tttgggctgg gcacaatgac tcacaactaa taaaactttg ggaggccaag gtgggaggat 5280 

cacttgactt gcagaaggtc aaggctgcag tgagctgtga ttttgccact gcactccagc 5340 

ctggtcaaca gagtgagacc ctggctcgaa acgttatttg attggttaca gttatacagt 5400 

tgccttattt ggtctattcc atttgaaagt tcctagttct ataattttaa gtttgttggc 5460 

tgtttctgat tggttaagct taagttttgt tttcctttaa tacagttaag tgccccataa 552 0 

tgacattttg gtcaaggaca gaccacatat acagtggtgg tcccataaga ttataatgga 5580 

gctgaaacat tcctattgtc tatggcgtag tggtcctgat gttgtagcgc aatgcattag 564 0 

ttatatgttt gtggcaatgc tggtgtaaac acacctactg cactgccagt gatataaaag 57 00 

aatagcacat acagttatat atagtacata atatctgata atgataatac ataactatat 57 60 

tactggttta tatatttact atattattta tctttatttt atttttgaga cagagtctca 5820 

ttctgtcacc caggctggag tgcagtggcg cgatcttggc tcaccgcaac ctccgcttcc 58 8 0 

tgggttcaag tgattctcct gcctcagtct cctgagtagc tgggattaca ggtgtgcacc 594 0 

atgacaccct gctaatatgt tttgtatttt tagtagagat ggggtttcac catgttggcc 6000 

aggctggtct tgaactactg acctcaagtg atcaccccgc ctcggcttcc caaagtgctg 6060 

ggattacagg cgtgagccac cacgcatggc ctatttataa ttattttaga gtgtacgcct 612 0 

tatacttata aaaaaaagct aactgtcaaa cagcctcggg caggtccttc aacagatatt 6180 

ccagaagaca ttgttatcat aggagatgac agctccgtgc atattattgt ccctgaaaac 6240 

cttctagtgt ggaagtggaa gacagtgata ttgatgatag gacccagtgt aggcctaggc 6300 

taatgtgtgt gtttgtgtct ttgcttttaa caagaaagtt taaaaagtta aaataaaata 6360 

caaaaatttt taaatagaaa aaagctgccc aggaacaatg gctcacacct gtaatcccac 6420 

cattcgggga ggccaaggtg ggtggattgc ttgagctcag gagttcaaga ccagcctggg 6480 

caacatggtg aaaccccatc tctacaaaaa atacaaaaat tagccgggtg tggtggcatg 654 0 

cggctatagt tccagctaat cgaggggctg aggtgggagg atcactgggg gggaggtggt 6600 

tgaggctgna gtgagctgtg attgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 672 0 

nnnnatattc ttaaaaaaat ttttttttat ttttgagaca gaatttctct cttgttgccc 6780 

aggctggagt gcaatggcgc tatctcagct cagggcaacc tccacctcct gggttcaagc 6840 

gattctcctg ccttagcctc ccaggtacag gcgcccgcca ccatgctcgg ctaatttttg 6900 

tatttttagt agagatgggg tttcaccatg ttgtccaggc tggtcttgaa atcctgcctc 6960 

aggtgatcca cccccctcgg cctcccaaag tgctggaatt tacaggcgtg agccactgtg 702 0 

cctggcctcc tttacatttt tttaaattta attttaattt tttaattttt aatttctcat 708 0 

atatatatat ttttaagact agccaagtga agcagtggga gtggaaaagg aactggtttt 714 0 

gatcaatagg tgtaaacacc actgcactgg gaccagccta ttttacattc ctgttagcag 7200 

tgatgagggt tcactttctt tgtagcctca acaatatgtg tcgttgccca tctttttttt 7260 

tttttttttt tttttttttg agatggagtc tcactctgtt gcctaggctg gaatgcaatg 7320 

gcatgatctc agctcactgc aacctccgcc tcccaggttc aagtgattct tgtgtctcag 738 0 

cctcctgagt agatgggatt acaggcgtcc accaccacgc ccggctaatt ttttgtattt 744 0 

tcagtagaga tggggtttca ccatgttggc caggttggtt tcgaactcct gacctcaagt 7500 

gatccgccca cctcggcctc ccaaagtgct gggattacag gcatgagcca ccgcgcccgg 7560 

cctgcccatc ttttttttgt tatagccatc ctagtggatg taaagttttt ttgtgatttt 7620 

gatttgtgtt tccctactga tcaatgatgt tgagcatctt ttcctgtgct tattggcttt 7680 

tggtatatct ttggagaaag gtctattcag gtcctttgcc cactttaaaa ttaggttatc 774 0 

tttctattac tgagatgtaa gagttcttta tgttctagat ataagtctcc tacatatgat 7800 

ttgtaaaaat tttccttcca ttattgggtt gtctttcact ttcttttggt gtcctttagt 7860 

gcacaacagt ttttaatatt gaagtccaat tttctatttt tctcttttgc cacttgtatc 7920 

ttggtgtcat gtttaaggaa ctattgccta atctcaggtc acaaagattt acacctgtgt 7980 

ttccttcttt ccttccttcc ttccttcctt ccttctttcc ctccctccct ctctccctcc - 8040 

ctccctctct ccctccctcc ctccttccct tcctccctcc ctccctcctt ccttccttcc 8100 

ttccttcctt ccttccttcc ttccttcctt ccttcctttg tccttctgac ggaatcttgc 8160 

tctgtcaccc aggctggagt gtagtggcac gatcttggct cactgcaacc tctgcctcct 8220 

gggttcaagc aattctcctg cctcagcctc ctgagtagct gggactacag gcacacacca 8280 

ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccaca ttggccagga 8340 

tggtttcgat ctcctgacct cgtgatccac ccgccttggc ctcccaaagt gctgggattg 8400 

caggtgtgag ccaccatgcc cggcctgtgt tttcttagag ttttgtagtt ttagctctta 84 60 

tagttagatc cttgatccat tttgagttga ttttgtatat agtgtgagat atccacctgg 8520 

tgttgtaaat tgcccagaag tgggtatgct tctaaatctg gctgttaggg attactagag 8580 
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gtgaccaaag tgaatttttt ctttgtttct tttttttttt ggagacagag tctccgtcac 8640 

ccaggctgga gtgcaatggc ttcatcttgg ctcagtgcaa cctctgcctt ctggtttcaa 8700 

gcagttctcc tgcctcagac tcctgagtag ctggtattac aggcgtgtac caccatgctt 87 60 

ggctaatttt tgtattttta gtaaagatgc agtttcacct gttggccagg cttttctgga 8820 

actcccggcc tcaagtgatc catctgcctc tacctcccaa agtgctggga ttacaggtgg 888 0 

gagccaccgt gcccagtcct tttctcagaa tttatttgtt tttttttgtt ttgtttcatt 8940 

tttgagatag ggtctcactc tgtcagctag gcaggagttc agtggtgtga tcattgctgc 9000 

agccttgaac ttctggactc acgtgatctt cccacctcag cctcctgagt agctaggatt 9060 

acaggcatgt gcttccacac ctggctaatt ttttaatttt ctaggactta tttgtccatt 9120 

cttgcaa^gc agggtacaac atgcctatct ctacctacct ctcttccctt caagggactc 9180 

cagccaaaat ccttgaggct ctcgggctga ctgtgggtgc tgttgcctga tctgcctcag 9240 

tcatgctgca tgatcaaaag tgtccgtttt ctgcttcttg gaactttatt cactttgggt 9300 

gtcagtcttc ctctgcagtg tcccaagaac acagaattag accaggaatc tgtgttgcca 9360 

tagtgtgtgg aaagaggcag acttccaact ccgctatgtg ctgttgggtg attgaagctt 9420 

aattttcttt ctatctttct ttcttttctt ttcttttttt ttttttggag atggaatctc 94 8 0 

gctctgttgc ccaggctgga gtgcagtggt gcgatctcac ctcactgcaa cctccgcctc 954 0 

ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac aggtgcatgc 9600 

caccatgccc ggctaatttg tgtaatttta gtagaaacag tgtttcacca tattggtcag 9660 

gctggtctcg acctcctcac ctcaggtgat ccacccgcct tggcctccca aagtgtcggg 972 0 

attacaggcg tgagccaccg tgcctggcac ttaattttct taatacctca attaccccat 9780 

atggtaaaat gggactagta atccatacct tatagcgctg ttgtgaaaat gaaatgaggg 984 0 

taagcagata aaatttcaga ctacggatgg gattgttact acattctgaa cctggctttg 9900 

ctgttatttg ctatgtgacc ttatcttctc tggatctcca ttctttccaa gtctataaaa 9960 

caaagtggac aattgtcaac ctttcttcca aagagcaatg atttaaggat caaatgatgt 10020 

catttaacaa aaatatgaag agctcaacaa atgaggaact cattattatt attacaatta 1008 0 

ttattatttt agaaataggg tcttgttctc ttgcctaggc tggagtccag tggtataaac 1014 0 

acagctcaat gcatcttcag cctcctggat acaagtgatc ctcatgtctc atccccctaa 10200 

gtagctggga ccacaggcat gtaccaccac gcacggctaa ttttttattt tttattttta 10260 

ttttttgaga cagtcttgct ttgtcgccca gactggagtg cagcagcgca atcaccgctc 10320 

actgcaacct ccgcctcctg ggttcaagtg attctgctgc ctcaacctcc caagtagctg 1038 0 

ggattacagg cctgtgccac catgcccggc taattttttt gtatttttgg taaagacggg 1044 0 

gtttcaccat gttgcccagg ctgatctaga acccctggcc tcaagtgatc cccctttctt 10500 

ggcctcctaa agtgctagga ttacaggcgt gagcctctgc acctggcctc ggctaatttt 10560 

ttattttttg tagagacagg ttctcactat gttgccaggg ctggtcttga actcctgggc 10620 

tcaagtgatc ttcccacctc agcctcccaa agtgctgaga ttacagatgt gagccactgt 1068 0 

gcctggcctg gaactcatta ttgaagcatt cactagtatc aactttgggg ttacctggcc 1074 0 

acatcctctg acctacctat aagggtatca cagctaacgg agcctctgtt tctcagaatt 10800 

taggcagaag cagttcaatt tatcacaaac tactctatat ccagcataag tgcccaaata 108 60 

aaacaattgc taaagttctt taggcattta ctgtttgtta gttagatatt tagtcctcac 10920 

tacaaatctg tgatacaggt attattttta ttaaccccat tttatagaag agaaacctga 1098 0 

agctcagaga tgctaagtaa cttgtgcaag gtcacacagc tagtaaataa agggcagagt 1104 0 

aaagatttag tttcacattg gactccagaa cctttctact gggactcatg ggaatagtgt 11100 

ggatgtccct gaccttcagt ggcccagggc tctcctgggg gaatccagcc atagacaaga 11160 

caccagcgag agcccaatcc taagattttg tttgtttgtt tttgagacaa ggtctcactc 11220 

tgtcaccaga ctggagtgca gtggcatgat caatgctcac tgcaaccttg atctcccagg 1128 0 

ctcaagcaat cctcccacct cagcctcctg agtagcttgg actacaggtg cacaccacca 1134 0 

cacctgacta attttaaaat tttatttaat taattactta ctattatttt ttgagacagg 11400 

gtatcacttt gtcacccaag ctggactgca atggtgtggt ctcagctcat tgcgtcctcc 11460 

acctcccagg ttcaagtgat cctcccacct cagcctctgg agttgcaggg actgcaggtg 11520 

tgcgccacta tgctcagcta atgtttttat tttttgtata gatggggtct cactatgttg 1158 0 

ccagggctag tctcaaactc ttggactcaa gcgatcctcc tgtcttggcc tcccaaagtg 1164 0 

ccgggattac aggcataaac caccacaccc aacccctaag gtgtttttgc tgaatgtgac 11700 

catgtcagag gcaggaaagg gaagcatcat ggggttagga aaggaacact gagcagggag 117 60 

acaaagaaaa tgggatcatt ttgtgagtgt tcgctgtgtg tgtatgtgtg acaattctca 11820 

gagccagcct ctcaggtggt tgagaccaca gtccccattt cccagatgag ataatggagc 11880 

ctcagagagt ttctgcagca cagctagtgg aattagaatt tgaacccggc tcttccagac 1194 0 

tccaggtgct tcacaaccat cccaaaccta gtcatttgca gtttaccttc atgattttac 12000 

catttccctt tgccatagct agtgttattt acttaataat tccttttgaa tcagtctgct 12060 

taaaaaaaaa tagcttcatt ctaaagtgta atattcttgg aatatcgggt ttgctgttac 12120 

ccacccccac acgttataca tatacatgta tgtttctaat acatatatat gtacgtatat 1218 0 

acgtgtatcg ttttttgtta ttttttttgt tgttgttagt tttttttaga tggagtctct 1224 0 

ctctgtagcc caggctggag tgcagtggtg tgatttcggc tcactggaac ctctgcctcc 12300 

tgggttcaag cgattctcct gcctcagcct ctggagtagc tgggattaca ggcacccacc 12360 

actacacccg gctaatgttt gtatttttag tagagacagg gtttcaccat gttggccagg 12420 
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tgggtcttga 

ttataggtgc 

aataactatg 

acatgctgtg 

aatcactgtc 

atgccaaaga 

ttgggctggc 

aagccttcct 

gaatgaatgt 

gtggaagagc 

tggcccatgt 

cttgagtttg 

ggacccttac 

attaaatggg 

tgggattgtt 

gtctaattca 

ggtaaagcag 

gtatgacctt 

tgcctagatc 

tcacccaggc 

tcacgccatt 

gcccggctaa 

atggtctcga 

acaggcttga 

tggattttta 

ataagggtca 

gtgcagcctc 

acagtgctgt 

ataaacctgt 

nnnnnnnnnn 

nnnnnnnnnn 

gggtagatca 

ctctacaaaa 

cgggaggctg 

atcatgccat 

aaaaaaaaag 

attggttcct 

cctggcttgg 

tccaccacta 

tggccagtcc 

atagtgatca 

catttgtggg 

gggctgcctg 

ctgtactttt 

tgccgtggcg 

acctcagtct 

gtatttttag 

tcaggtgatc 

gcccggtcct 

ggcatagttc 

agctcttggc 

gtgcagcaaa 

atttcaattt 

agtgtcccct 

cagaccctgg 

ggagaggatg 

ttgccagccc 

cctcaactgg 

atcatctgcg 

cattaaactt 

catattaata 

gtagattttt 

ggctgtggca 

agcactgcac 



actcctgatc 

gagctactgc 

aaacctgaaa 

cataccacat 

ccatcctatg 

ccttttctgt 

tctcctttta 

tatgaatgaa 

atgaattgac 

acatgtcctt 

aacacagagg 

aatctgagcc 

catttataaa 

attctctata 

tgtgagggaa 

gggactgcaa 

ggcaggtgta 

gagaaggcaa 

ttctaacttt 

tggagtgcag 

ctcctgcctc 

tttttttttt 

tctcctgact 

gccaccgcac 

tgtgataact 

acattgccag 

tggtctaaaa 

tgtatagtaa 

agagaggccg 

nnnnnnnnnn 

nnnnnnnnnn 

cctgaggtca 

ataaaaaaat 

aggccggaga 

tgcactccag 

aaaaaagaaa 

gagagtagcc 

gccaatgaat 

gggaggtctc 

tcctacagtc 

aggccgttac 

tcttttggaa 

tccctgctcc 

tttttttttt 

tgatctctgc 

cctgagtagc 

tagagacagg 

cacccgcaca 

accaatctgc 

tgagaagcat 

agagacatct 

ttttgtctag 

ttaagaaatg 

taatttctta 

ggtgacagaa 

atctacagag 

tgaagctatg 

ggccattttt 

ggggtgggga 

tctacaacgc 

gcacaaagtt 

cacacattaa 

ggaggattac 

tccagcctgg 



tcaagtgatc 

ggctggccaa 

aactgctcca 

attgggaaac 

cttcgcttta 

tgtcattaac 

gttcagaatt 

ctcgttggtc 

actaagtcat 

tctggcactg 

ccctttggtg 

tgcctgtaag 

atggggacca 

gaaaatattg 

attgtcattt 

atggaaatgc 

gaatagcaat 

gccctggcaa 

ttatttttat 

tggcgcgatc 

agcctcccaa 

gtatttttag 

ttgtgattta 

ctggccgatc 

cctgatttta 

cagagcaaaa 

tcatctttgt 

atatccagta 

ggcacagttg 

nnnnnnnnnn 

nnnnnnnnnn 

ggagttcaag 

tatctgggca 

attgcttgaa 

cctaggtgac 

agaaaaagaa 

aaccttattg 

gaggaagaat 

aagcttgaaa 

aagtagataa 

tctgcaatca 

gagcagagaa 

ctgagccatg 

ttttttgaga 

tcacagcagc 

tgggattaca 

gtttcactgt 

tgcctcccaa 

acattttaat 

cccacaagga 

ggcaaattca 

agcttggtca 

ttgcattaaa 

gtgtgtctga 

gactggtggg 

tgctggagag 

ccttgaggtt 

ctccccagag 

gaggggctac 

ccaggacagc 

gaggaacctt 

agatgagaaa 

ttgagcccag 

gtgacagagc 



cacctgcttt 
tgtatgtttt 
tgttacttcc 
actgctttcc 
tttgtctata 
atggacacag 
tggattttta 
aaactcataa 
tagcatttca 
atgtgtgctt 
cagtggaaaa 
aagctggcta 
ttgtactcac 
gcacaaagta 
gattatcaaa 
cagctgaggc 
gtctgggaat 
aaggggatgg 
ttttattttt 
tcggctcact 
gtagctggga 
tagagatggg 
ccctccttgg 
ttctaacttt 
aactggcacc 
catgggtggg 
atttcttctt- 
agtgggtgta 
ctcatgtctg 
nnnnnnnnnn 
nntagtccca 
accagcctgg 
tgattgcagg 
cctgggaggc 
ggagcaagat 
acaatgaatg 
gtcatcccag 
aatggcaaat 
tccagtgacc 
gtggagggtt 
gactcagaaa 
gaggaaggaa 
gagggctggg 
cagagtctcg 
atctgcctcc 
ggtgcccacc 
gttgggcagg 
agtgctggaa 
tgacaagggt 
tgcctctgag 
aggcttgaga 
gttaaaattt 
atgttattta 
gttgagagcc 
agatgggagg 
actctgtatg 
gggcaaggtg 
gacatttgga 
tgacatctgg 
ccctccacaa 
gatctagatc 
gcttgtgcct 
gaattcaggg 
aagattttgt 



ggcttcccaa 

taatacacat 

tgaacccatc 

ctggcttcca 

aatgttgggg 

ctctaagagg 

tccaactact 

aaggctgatc 

cgggaatgga 

gggaaactta 

ctgttgactt 

actgaattgc 

cctttagggt 

ggtgtaaatt 

gacttaggag 

caggcatttg 

taaaagagag 

cctccactca 

tgagacggag 

gcaagctccg 

ctacgggcgc 

gtttcaccgt 

cctcccaaag 

ttaaagagaa 

caattataat 

ggcaactgct 

gctttacgca 

gaatgaataa 

taatctcagc 

nnnnnnnnnn 

gcactttggg 

ccaatatagt 

tgcctctaat 

ggaggttgta 

tctgtctcaa 

agtgtgaggc 

ccacgaagtg 

gggggtctat 

aggtttttag 

tggtccgttg 

tggcctctca 

tttaggtctt 

gtggaatatg 

ctccgtcgcc 

cgggttcaag 

accacgcccg 

ctggtctcaa 

ttacaggcat 

caccctccac 

ttcaccctga 

catgctggcc 

tgatgttttg 

tcttgaatag 

tcccctgcct 

tgaggagggg 

gagcttttca 

gcatatccta 

aacatgtgga 

tgagtagaga 

taaagagtta 

cacagcacag 

gtaatccctg 

ttacagtgaa 

ctcttaaaaa 



agtgctggga 

tcaaataacg 

ttgagtgctc 

agcccagctt 

ttgggggttg 

tcttggcatc 

cagagtgatc 

gataaaacag 

ttctccgtta 

ctgagctaac 

tggagattat 

tttgcttctt 

tattgcatgg 

tgcacgctag 

caggaacagt 

ctaataattg 

gtgaggacgt 

gctacagtca 

tcttgctctg 

cctcccgggt 

ccaccaccat 

gttagccagg 

tgctgggatt 

gcaagacatc 

ttacaacact 

ggtcaccggt 

ttgtcccagc 

accaatgcag 

acnnnnnnnn 

nnnnnnnnnn 

aggccaaggt 

gaaaccccgt 

cccagctact 

gtgagccgag 

aaaaaaaaaa 

tcatggtagt 

aaatggtacc 

gcctccaccc 

gtcctggacc 

ggctacggag 

gttacttctc 

ctcaccctct 

gggaataaat 

caggctggag 

ttattcttcc 

gctaattttt 

atacctgacc 

gagccaccgt 

tcatgtgcca 

caagtccact 

tctctttaaa 

ttttgcatta 

taaatttctt 

gattctagtc 

agtgttggtt 

tgctgcctgt 

gatcagagat 

gacatttttg 

ccagagggac 

tttgacctca 

aagaaaggat 

tgactcagga 

ctatcatcgc 

aaaaaaagat 



12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 
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gaggacaggc 
gatcacgagg 
aaaaatacaa 
gggaggtgga 
gcaagactcc 
agagagagag 
cttccacttc 
tgcctctgtg 
gtgttgcaca 
cctcttgttt 
tgggattcca 
ttttaggggg 
gaagcaaacc 
gcagtcccag 
tcctttgtcc 
acccagtata 
gactcttgga 
agaagcagca 
agcagacctg 
cggagactgg 
tagactctcc 
taatcaaaca 
acaaagacag 
caatgcctct 
aagtctctcc 
ccccccgctt 
cagctcatct 
tttgtttgat 
tgttttgaga 
tcaccacaaa 
tgggactaca 
gtttcaccat 
cctcccaaag 
ctttcagaag 
ctcacacgtg 
ttcttgtcca 
ctccagtctt 
ccacggtgct 
aatgatgccc 
agttcaaagg 
gaggaggtca 
gacccagctg 
cagggaagtc 
gcagaagctc 
cattcagtct 
agaatcacct 
atttaggagg 
atgatattgt 
gcctgtgtta 
cctttcaaag 
agttgtggtg 
gtaataaact 
attgtcctga 
ttttaacctg 
agagggaatt 
nnnnnnnnnn 
gttagcacat 
tggaggcaat 
tcaggaaagg 
gatgaggctt 
agtttggatt 
tgacaatgac 
cagaagatgg 
ggggacgatg 



acagtggctc 
tcaggagttc 
aaaattagcc 
gcttgcagtg 
gtctcaaaaa 
gaaagaaaga 
ccatggaacc 
aagtaggggt 
ctatgcccct 
cttcccccca 
ccagcaggag 
cataaataat 
agggtgtggg 
cagagagcag 
cccactctgt 
aaggctgctt 
aggtgagttc 
cctcataggg 
tccccgcagg 
aagacaaatg 
agctcagggt 
attaaacaaa 
ccccaagaag 
tgctcctgcc 
cttccctcct 
ttatttgcac 
ccggtaagat 
ttttttttgt 
cggagtctcg 
ctccgcctcc 
ggcacatgcc 
atttgccagg 
tgctgggatt 
gaaaaaccac 
ctttctccca 
tttgtgggtg 
gtcttcaggt 
ggaactagga 
cccatctgga 
agcaggagag 
aggcttctgc 
ccttagacaa 
caaggtgcaa 
aggtaagtag 
tctgttccct 
ggagttgttg 
gctgaggtga 
tggtctgggg 
aattctggtt 
ctttagcacc 
agcattaaat 
ctcaataagc 
cagctaatta 
aaaagggatg 
cnnnnnnnnn 
nnnnnnnnnn 
agccagaaca 
ggcacttgca 
agagcttcag 
ggaggctggg 
ttctcctact 
acgtttgcgt 
aagataaagc 
tgagtgacag 



atgcctgtaa 
aagaccagcc 
agctacttgg 
agccaaaatc 
gaaaaaaaaa 
gagaaggttt 
ttggcaagtt 
ggtacctggt 
aggatgggtg 
ttgtctccca 
gctactcctg 
aattatgcct 
gtacccaaac 
ggcacagggt 
cctggagcaa 
ctggcccctg 
ttttctggct 
caaacacgta 
ggcaggagtc 
aaatttggta 
ctgttcatgg 
tattccctct 
gacggtggag 
ctttctcctg 
catgcgtctt 
tcatgaggcc 
atcacttgga 
tgttgttgtt 
ctctgttgcc 
cgggttcaag 
accacgcctg 
ctggtctcaa 
acaggcgtga 
ccacctgtta 
ccttcaaaac 
tgacccgaga 
ggaattccta 
agctcttgac 
gagcctagag 
agacttggct 
ctgtcaagac 
ggacttccag 
gatctgccgc 
tagggaggct 
attcatttag 
ttaaaacaca 
agcctgagaa 
agaaccaccg 
ctgcctttta 
ttcatttgta 
gagataatac 
ggtggttata 
agaagcaaaa 
ccgggctctt 
nnnnnnnnnn 
nnnnnnnnnn 
tctagaaggg 
aaggctttgt 
cttcagtgtg 
cagattttgc 
aaggacaaca 
tttggtggct 
actaaagggt 
taaatcattt 



tcccagcatt 

tggccagcat 

gaggctgagg 

ttgccattgc 

aaagatgaga 

tggagtcaaa 

gccttctctc 

ggagatgatg 

agagcttggg 

ccagctccct- 

gagttaaggc 

ggacaatgga 

aaggcagtag 

tgagaagact 

tcattctggc 

aagccagagg 

gcccaggcag 

ggaggcctgt 

tgggctaagg 

cctgagttat 

caagaggaga 

atactatgtg 

tagtgtcctc 

ctgggaacag 

tccctttttt 

aggaccacag 

ctcagaactg • 

tttgtttttt 

caggctggag 

caattcttct 

gctaatcttt 

actcctaacc 

gccaccgcac 

agatgaaggg 

acacactcgt 

gatctctcct 

ggaaagctcg 

ctgatggcag 

acacaggcag 

ctgacggagg 

aactttgagg 

tgccccaggg 

tacctattgg 

actgcggagg 

gctagtggtt 

gctttctggg 

tttgcattta 

atttaaacaa 

ttagctgtgt 

aaacgaagat 

atgaaaagtg 

attattatga 

gataggatta 

gcctggagac 

nnnnnnnnnn 

nnnnnnnnnn 

gtggtaggag 

tgaagtggcg 

gctggagtgc 

tccaaaagag 

gcaaactatt 

cactcgagct 

gatgaggcag 

gttgggttgc 



ttgggaggcc 
agtgaaaccc 
caggagaagc 
actccagcct 
aagaggaagg 
aagacttaga 
tttctctgaa 
cggagatgag 
agcactgaac 
gggatctcca 
gtgttgttca 
cataacatct 
gccccaggag 
gagcaaactt 
ctcttcccac 
cactgagagt 
gaccagtgta 
ccttaggaac 
gtgatactgg 
ccctcccacc 
aagcaatctt 
ccaggggcta 
gctaaaagac 
tttctgctct 
cctttttcct 
ccttccctct 
taacctggaa 
tttttgtttg 
tgcagtggcg 
gcctcagcct 
gtatttttag 
ttgtgattcg 
ccggcaaact 
ctggtgactg 
ggtgtcggcc 
tacctaacac 
agcgacgttg 
gttacctctt 
acctaggcca 
aggtgattca 
atgaggaagc 
aagaagacat 
tgcggactcc 
acctggggga 
ctcaaagcct 
cctcacctgc 
caacaaatcc 
aaggctttgg 
tacctgggca 
atatagcacc 
tttggaatag 
gtattatcat 
agggaggcaa 
tcagaaactt 
nnnnnnnnnn 
nnnnnnnnnn 
tggggattag 
taagtgtgga 
tgggtgtgaa 
cttggtgaac 
gaagagttta 
gccagccagg 
gaagccagtg 
tattgtgtgc 
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gctaaattct tcacgtgtat tatttcagct aatccatcta acaactctgt aaggcaggta 20160 

caatcgttcc cagctgaaga agctgaggct ctcaaaagct agtaacttgc ctaagttcat 20220 

gcagcatgca agttgtccag ccaggattct aacttagaca ccagaggcca cttttaacca 20280 

ctgctctagg actgggggaa atggtcccta gtgagatatg tgtcgagttt catatttcat 20340 

tcaacaatat tgttggcctg ctacatgtga agagctgtgg aaagcgccca aagtgagtta 20400 

gatccctatg agcaagtggg atgggggtgg agtggacagt aggagggctg gaacacacat 204 60 

aaaagggtat aagaaataac aattaggccg gccaggggtg gtggctcacg cttttaatcc 20520 

cagcactttg ggaggccgag gagggtggat cacttgaggc caggagtttg agaccagccc 20580 

ggccaacatg gtaaaacccc atctctacta aaaatgcaaa aattagctgg gctggtggtg 20640 

cacgcctgta atcccagcta cttgggaggc tgaggcacga gaatcacttg aacccaggag 207 00 

gcagaggtta cagtgaactg agattgcacc actctactcc agcctgggag acagagtgtg 207 60 

accctgtctc aaaaaaagaa aacaaaacaa gtaggtactt tctgccatag ggaggattca 20820 

taaactgcta gtcctcaggt gcatttttgc ttatcagttt taaaaatcag agaatgtctc 20880 

aaagaattag gatgtcagct tcttttgaaa atttgggcca gaagcggtgg ctcacgcctg 2094 0 

taatcccagc actttgggag gctgaggtgg gtagatcacc cgaggtcagg agttggagac 21000 

cagcctgacc aacatggcga aaccccgtat ctactaaaaa tacaaaaatt agctgggctg 21060 

gtggtgcatg cctttagttc cagctactca ggatgctgag gcatgagaat cacttgaacc 21120 

cgggaggcag gggttacagt gaaatgagat tgcaccactg cactctagcc tgggagacag 21180 

agcaagaccc tgcctcgaaa aaaagaaaaa gaaaatttgg aagatctgac aacagttgac 21240 

ctgcattcct gctcggcaac agcctgatgg tggatgggca gaggctcagt tgtctgccaa 21300 

acctcccatc actgatgtct tccctcgctg tcatcatctg cttgacatgt aggcatttgg 21360 

tgtgtgcctt ctgctctggg tgcccagatg aattggatgc tatatgagaa aacattctgt 21420 

aaatgtcttg tggtaggcaa cctcaaagat cactggggcc tccaatgatc cctccttcct 214 80 

ggtattcatg cctgtgtata atcctctccc ttgagtgtgt actacacctg gatacttgct 21540 

tctaataaac agaacacagc aagggtaatg ggatgctact tctaaggtta aattacaaga 21600 

gtgtaaagtc tgtcttgttt gtttccctct cttgatcttc ctctcattct ctctctctcc 21660 

ctctctctca ctttcttact gtcttgtcct tccctttgtt tactctgatg aagcaagcta 21720 

gcaagcatcc atgttgtgag ctgacctatg aagaggccca tgtggtggta aggaactgag 21780 

ggcagcctct acccagcaag gaactgagtc actcatcata tgggtgagct tggagacaaa 21840 

tccttcccca cttgagcttt cagatgacgg cagccctggc tgatgctttg caggcttgtg 21900 

agagaccctg agacagaaca ctcagctaag ctatacccta tctcctgaga tagagtataa 21960 

tacatgtagt tttaagctac tatgttttgg gataatttgt tactcagcaa tagataacca 22020 

atacatatac catgtacata actgtttcag ttgtctgaga ctatatttag tcattttaca 22080 

cctacatcaa gaatgtgtca ggcaccattc caggtacttg gaatacatca attaacagaa 22140 

taggtaaaga ggccaggcat agggctcaca tctataatcc cagcactttg ggaggcccag 22200 

gtgggaggac tgcttgagcc caggagttga gaccagcctg ggtaaaatag tgagacactg 22260 

tctcaactaa aaaaaaaaaa aattagttgg gcacagtggc acatgcctgt ggtgccagct 22320 

gctcaggagg ctgaggtggg aagatcgctt gagcccagga gtttgaagct ccagtgagcc 2238 0 

acggtcacaa aactgcactc tagcctgagc aacagaaaaa gaccctgtct caattaaaaa 22440 

aaaaaaaaaa aaaaggaaag aaagaaaaaa ataggtaaag atccttgatt cttgccctct 22500 

tggaacttct attctagagg gggatggttt ttcacagtag aagtctgtgt tgacagcgct 22560 

gtttaaagct ccttcagcat ctggggaaaa ggttnnnnnn nnnnnnnnnn nnnnnnnnnn 22620 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2268 0 

nnnnnnnnnn nnnnattttt tagagatagg gtcttgctat gttgcccacc aggctggtct 22740 

tgaactcctg ggctcaagca atcctcctgc ctcagcctcc tgagtagctg ggaatacagg 22800 

tgtgcaccac catgcctggc ttatttcata tatatatatt tttatatata tgtatattta 22860 

tatatataaa tatatatata atttctgtat ataaataaat aaatatatat atatatatat 22920 

ttttagagat agggtcttgc tatgttgacc accaggtctt gaactcctgg gctcaagtga 22980 

tcctcctacc tctgcctttc aaagtgttgg gattacaggc gtgagccatg gcacctaact 2304 0 

gagttatttt taccacacga agcataggac atacatccaa aaatgttctg agctgagcaa 23100 

gagcctggag gcaagtgaat ctgaactttc ccgtctttga agaaaccagt ctctctccaa 23160 

agtcacatag ttagtgtcac tccccccaag aactgcatga gctgggacaa tcagagggca 23220 

gtggaaggtc tggggctcag gggcgccccc tgctgtctcc ccagggtctg tccccttacg 23280 

caagagcctc tgctccccca ctttcctgtg gagcctcctc accatgggca tgacccagct 2334 0 

gcggatcatc ttctacatgg ctgctgtgaa caagatgctg gagtaccttg tgactggtgg 23400 

ccaggagcat ggtgaggcac cgctgaggcc cctgggggtt gggggcacag gcgggtcacc 23460 

ctggctgagc tcccctcacc atacgtttcc ctacccacag agacaaatga acagcaacaa 23520 

aaggtggcag agacaggtag ggctatgaaa gcagggccct ggctcacgcc caccccactg 23580 

caacccgctt ctcagggggc gggactcctc taggcctggg cccacccagg taaccctttt 2364 0 

gtgggatgta agagtctggg ttcagaggaa ggctattttg gtgctctctg gcctccgctg 23700 

gaaggggtga tagtgtccac tgagtgccag ttcctgaccc cactgccctt cccatcctgc 23760 

ccagttgggt tctactcctc cgtcttcggg gccatgcagc tgttgtgcct tctcacctgc 23820 

cccctcattg gctacatcat ggactggcgg atcaaggact gcgtggacgc cccaactcag 23880 

ggcactgtcc tcggagatgc caggtgacct gcctgtacag ggatggtgac agcaagtggt 23940 
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caggcagtgc 
agaaccctag 
agggacctgg 
caccttgcct 
ttgcagttga 
cagctttcag 
gggctagaat 
aggcttggca 
atcagggaga 
gctgtcagcc 
ttcaattacc 
gagcaagagg 
ataccatcac 
ggagactgga 
ctggagacca 
cccaggctgg 
agcgattctc 
gattctgtga 
aggatgggag 
agcctggccc 
ctagcagggc 
ctctctcctg 
tcaggccaag 
ctgaggactt 
cttggttctc 
tgcagatttg 
catgttaaga 
gaaacatgga 
tgttttctgc 
cagggagtcc 
ggttgctacc 
cagtgccttc 
caacttacac 
tccaaaggga 
ctggatctat 
tgggaattga 
ctctcctaac 
cattgttcga 
ttgggctgaa 
gggagggccc 
tcacttcctt 
tagaggaagc 
ggcccccaga 
ttgccttata 
cccctgctgg 
acaataatga 
aggatccttg 
gctctctttc 
tcacactcca 
tctgaccctt 
cctggattat 
tatttttaac 
tggtgtcctg 
ccctgcaagc 
ctctgccaca 
gcatctttgc 
gcccaatctt 
gcctcccttg 
tttttctcag 
ctgctccatg 
tatctccagc 
acagaataca 
ggggctcagg 
gctcagtggc 



ttttcatttt 
gagatcaggg 
cttgcttaat 
tgtttagtcc 
ggaaaccgag 
tggcagcctc 
aggtagagtt 
ttttaaaaca 
gactcagccg 
agggctgcag 
cccatggcct 
gctaagcgac 
ttctgttacg 
caaagcaggg 
ccattgaggg 
agtgcagtgg 
ctgcctcagc 
ctgcttttgc 
aaagagactg 
tggcactgag 
ccttctgtcc 
ggctggcctc 
ctgctgtccc 
ctgatgagag 
aaaccttcaa 
cagatatccc 
aagcacccgc 
gctagagaac 
tgccctgaca 
tgggaacgta 
aaatccatca 
accctgacca 
ctccaggtac 
tggtccatcc 
tttccgggtt 
ggcccagttg 
ctcttcactc 
ggtttcttcc 
atgccttcct 
caggcatccc 
aaaggcagcc 
tgaaatacct 
atggcctccc 
attttcctct 
gcgctgctga 
tgacaaaatg 
accatcactc 
cttcatctgc 
tgttcactcc 
cttgtcacct 
tgtggcagcc 
acaggggctg 
tctcatctgt 
tctggcccca 
gccacagtgc 
atgtactagt 
accttctctc 
gctgtcccat 
cgtttatcac 
ggaatgccac 
acctagagca 
gtggacagtc 
gagtttgagc 
cctgaagttg 



ctctgtgcgt 
gtacctcccc 
aataagaata 
tacaacacct 
gtacagggtt 
cagaacctgt 
ggtatattag 
acaaacactt 
gctgattctg 
tcatctgaag 
ctccataggg 
agagaccccg 
ttctattata 
gaataccagg 
tttttttttt 
cacgatctca 
ctcccgagta 
tctttttgtg 
ggagagaagg 
ccttcctcct 
aggacacatg 
agtgcccttg 
caggagggca 
acggtgtgag 
gtgtgcatga 
cgctgagatt 
tttgatgcgt 
gtgggtttct 
tcgaaggcac 
gaggcagcag 
gaccacgcta 
acctgctgct 
ccaccttcat 
agcacctgcc 
ctccaggaag 
gcctcttcgc 
tctctttttg 
actcagcctg 
gagctttgca 
ctaatgcacc 
tatcctgctg 
ggcttgatga 
ctccatgccc 
tcatgttgga 
gggactctcc 
acaaccatta 
cctgagttca 
tcaagccaga 
atccfccagta 
ccactgctgc 
tttagtctcc 
cagtcacctg 
ctcagagtaa 
gcacttcact 
ttcctagtgc 
ccctgtttct 
cttcttcagg 
gtacaattgc 
taacactctg 
ctcctcgatg 
gtgcctggca 
ctgcgaggtc 
cacttgccaa 
gagcatttgc 



ttacatccag 
attttacaga 
gctaatgcag 
ctttgaggta 
gcacagttag 
gttataaggg 
atatttattg 
ctcatctcat 
agtcacagtc 
ggctgactgg 
ctgctcagga 
tatcttctct 
ggcacagagc 
aggcaggatc 
tttttttatt 
gctcactgca 
gctgggattc 
ttcatccaaa 
aaatccagtg 
ctaccctctg 
ggccgcctgt 
attgtgccgg 
tctgcttatg 
agcttcccac 
gaatcactta 
ctgagggccg 
gtgtcattgg 
atgggtttcc 
atctgaaggg 
ccagtgactt 
ctgcaagatc 
tgtgggtttt 
ccttcccctc 
ttccaggaag 
tgtttctagt 
cctacccctc 
gttttagttt 
tgggagtctc 
accgtgatca 
tctctctgag 
taattgactc 
cactttggtt 
accctgaaga 
cagtccccac 
cctgtgtatg 
tcaagtgctc 
gcctcactgc 
accctggagt 
cagccagcag 
ctctccagtc 
cacatctgcc 
tcaggacata 
aagccaaagg 
cccctctagc 
tccggaagtc 
agaacattct 
tctttccctg 
aacctcccta 
cctatctctt 
gcaggaatct 
catagtaggt 
aatgccatcc 
tatcacacat 
cctcaaggct 



cagcttgttg 
tgaggaaact 
agtgctgact 
gatgcgttaa 
gtcattcacc 
tacacgctaa 
ctgtataaca 
acagtttctg 
tctcatgaag 
ggttggagaa 
cacagcacct 
cacataatct 
aaccctgata 
cttgagggct 
gagacagtct 
acctctgcct 
accatggagt 
cagtccctgt 
gcctccctcc 
ctcctaatgg 
cctcacccca 
agagaggaag 
tctagggcag 
ttcccacctt 
gtgggggata 
agatgaggcc 
gtaggggagc 
catagaaaca 
ggaggggcca 
cccgtactcc 
caaaagctca 
ggcatcacct 
tccctgcctc 
cgcagttctg 
agattgggtt 
cttcctccag 
gtgacctttg 
tatgctgcag 
gagaacccca 
accctctgat 
cccctgttgg 
ctatgtctgc 
aatttcccaa 
ttgcatctct 
tgatggagta 
cgttggtgca 
agcggtctcg 
ttccttgatg 
cagcttctac 
ctagccacca 
ctggccccgc 
agtctcttca 
ctttactatg 
tccccctcct 
tcaagtgtgt 
tctccagata 
actgtcctct 
ctgcccgctt 
gcttattgtc 
gttgacttgc 
tctcagctaa 
ctgttattag 
acaggaggtg 
ggggaccaaa 



ctttctccca 
gaggctagga 
gtgcacttgg 
tatcttcatt 
caagatcaca 
agtcttgtta 
aatcacccca 
acagtcagaa 
acatagtcag 
tctatgtcag 
gctttccctt 
cagacgtagc 
tactgtggaa 
gtcttgttgg 
tgctctgtcg 
cccaggttca 
cttgaaccca 
ttatcctaag 
ctgctagcag 
tgagggtccc 
gcctactgac 
cgctggacag 
ggacaccttc 
ccttcccatc 
tttgtccaaa 
tgtgaatctg 
aacactttga 
tggatttctg 
ggccaagaac 
tcagggacgg 
ccaatgccat 
gtctcatcaa 
ccgaggctcc 
gtcttctgat 
ggcgaggggg 
cctccacaca 
tcctgcacac 
tgtgagtctg 
gggaagggtt 
ggcagggagc 
agtcttccct 
tgtttgaaac 
gggcagccat 
ctcctggttt 
acaggacatt 
ggcagcaggc 
gcagagggca 
tttctctccc 
acaccccaaa 
acatctctag 
tgtctcagtc 
catcactctg 
gcctaaaaag 
ccattgttca 
tccctgcttg 
tctgcaaggt 
tctcagtgag 
ctctgcttgg 
tgaccgccac 
ttgatcgtgg 
atgtttgttg 
tggaggaagt 
tgagaaccca 
gagcccatgc 
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aaagagcccg 
nnnnnnnnnn 
nnnnnnnnnn 
accacaatta 
cctgcaatcc 
agaccagcct 
ggtgtggtgg 
aaatcatgcc 
aaataaataa 
taaaaacacc 
gggcaagcgg 
agccctgtct 
ccagctcctc 
gagccaggac 
caaacaaaca 
aaagcagagc 
tgaatccaaa 
ctctgcccac 
tgggccagtt 
tgatgaccac 
gtgggggaga 
aaagaagttc 
agaagtttag 
cctagagagc 
atctctgaaa 
cattcatgtt 
tgctctaggt 
gggaaaaggc 
cacgtgcaac 
tcaaccccag 
tcttatggaa 
gataattaat 
gaccttggtg 
atgtgtggag 
gcccaccctc 
caaactcctt 
aacggtatgc 
ggtcctttga 
ggtcccctgg 
cactttgtga 
ccaaaggctg 
tgattctgta 
catctgagat 
cagggagacc 
ggatgatgac 
ctcactcggc 
ccttgattcc 
acgtggagcg 
ggtggggaca 
gagacgcagc 
ctctgccccc 
cctcagagag 
cacctctgaa 
ggctcatgcc 
ggagttcaag 
ttagctaggc 
aatcgcttga 
gcctgggtga 
gaaacagtga 
tgtactcaag 
aacccaaggc 
gttgacgtcc 
tggaaataag 
cagagcaggg 



aacgcttaag 
nnnnnnnnnn 
nnnnnnnnnn 
cttttgcacc 
cagcactttg 
ggccaacatg 
cgcgtgcctg 
actgcactcc 
ataaatattt 
catgggaggc 
ggaggatccc 
ctacaaaaaa 
gggaggcaga 
caagacacta 
aacaaaagcg 
ttggagggct 
gttaattccg 
tccagccact 
gagggtgcag 
gaaggccaca 
ctgcctggcc 
tcttctcccc 
agtcagactc 
tatgaaatag 
tttccattat 
catgttcgca 
ttaccgggcc 
aaacccagac 
aactgggggg 
ggcaggtaag 
gaagctgatg 
gtctgctctc 
ccttttcctc 
gtaagatctg 
catgcctgcc 
tggagacact 
agtttggtac 
tatttgacag 
cttcgaggta 
ccttggacca 
aataaagcca 
ggtgaagcta 
ctttctggaa 
agctgagaaa 
agagaaagga 
cccacccggt 
tgggaacgta 
caggttccca 
ccgtgaccct 
agaaaagccc 
tgcctacaca 
acattttgtt 
aaacacatgg 
tgtaatccca 
accagcctgg 
atggtggcat 
acctgggagg 
cagagcgaga 
tcttgtccaa 
ttcacagagc 
tgtgaccatg 
cagcccagtt 
ggagggaact 
tcagaaccca 



caccaccctg 
nnnnnnnnnn 
nnnncccact 
cacataaata 
ggaggctgag 
gtgaaacccc 
taatcccagc 
agcctgggcg 
accatgtttt 
tgggcacagt 
ttaaggccag 
taaaaaaatt 
gggagaggtt 
cactccagcc 
acctgtgggt 
caagcgattt 
tctatcatat 
cacgtgtaat 
ttcagaaaga 
ccggcatcgt 
ccagacccca 
ttgtccttgt 
agaagaccca 
tctccacaaa 
cttaaaaaaa 
caaaaatgtc 
taatcgtgtt 
tgctcatgct 
gttggggaga 
ttctcaattg 
caccatctgt 
ccctcaggct 
cctaggggca 
gctcttcaaa 
caccagaaca 
gagcaaaagt 
tgcccaccgt 
aggaaattga 
caccaggttg 
aaccacccat 
gagagtcagg 
aaccactgaa 
gtcacgtgag 
gcttgctgtg 
gagcggagaa 
tcttgtgtca 
atggcagaag 
aacatgttgg 
tgcccagcat 
caaggtaaaa 
gtcacagtgc 
aattttgttt 
aataggtgct 
gcactttggg 
acaacatggt 
gcgcctgtaa 
tggaggttgc 
ctatgtcccc 
cccatttgag 
acattaatgg 
gctgtgtccc 
tctgcctaat 
gagccagggc 
caccaggaga 



cctggccagc 
nnnnnnnnnn 
gcgcctggcc 
gttaccatgg 
ccaggcggat 
gtctccacta 
tattcaggag 
acagaatgag 
gaccacctgt 
ggctcacgcc 
gagttcaaaa 
aactgggcat 
cgcttgagcc 
tgagtgacag 
aggtaggaac 
gccaagcaga 
caccacagcc 
tgacttcctc 
taaggcaggc 
agatcagctg 
ccaaggtaga 
gcccaatttg 
tcatfcccttt 
gcctgaagtt 
tacatacatt 
catttcatag 
tatcctgccc 
cagccttttc 
gggaggtgca 
aatgagattc 
cctcttgtcc 
cctgctcaaa 
ggacagagct 
cactacttca 
actgatggcc 
accttcttta 
ggaggttgag 
ggcccccatc 
acctgtccag 
cctctctgga 
gaccttgaac 
ggtttttcag 
tgacagagta 
gcccaggaga 
gtcagacccg 
agacaaaaag 
aggcgtaaga 
gcagctgagg 
tcccagccct 
caaacagaaa 
cctttagctt 
aattccaggt 
taagaaacac 
aagccgaagc 
gaaaccccat 
tcccacctac 
agtgagccga 
accccccaaa 
atgagacaat 
ctttctcccc 
ggtaataggc 
caggacaaat 
agacagtctc 
gaatttaatt 



ggggnnnnnn 
nnnnnnnnnn 
cattactttt 
gctgagcatg 
cacttgaggc 
aaaaatacaa 
gcagaggttg 
actctgtctc 
tatgtgccaa 
tgtaatcgga 
ccagcctagg 
ggtggtgtgt 
cagcagtttt 
agcaagacac 
aggctcatag 
ggtccaagcc 
ctctctgccc 
aggggcagga 
caggccagac 
tgagaggagg 
tcccaggcct 
cagatgagga 
ttctttttca 
gctggccact 
tttgcctatg 
tacgtacaaa 
cttcctggca 
tcacctttcc 
agtgctctgc 
tgtgcaaatg 
tccccatacc 
cctttttctc 
tcaaagggcc 
gttgaaaaga 
cccccaccca 
gtactctttg 
gagcatggca 
ttgcactgag 
gatccagcct 
aggtgtggaa 
gcatgtgaag 
cagtgtgtga 
cagagaaaaa 
gagggggaag 
tgggtcagca 
aaaacccagg 
gccaatcatg 
agggaaagca 
gctccataca 
aatcaatgtg 
caaaaaggct 
ttcccaagtt 
tgatcttggc 
tggtgggaag 
ctccaccaaa 
tccagaggct 
gatcgcacca 
aaaaaagaaa 
tgagacccag 
attgtcgttg 
agtgcctctt 
cacatcctgg 
cagaggaggt 
gatcatgtgt 



nnnnnnnnnn 
nnnnnnnnnn 
aatggcaaaa 
gtggctcagg 
caggagttca 
aaattagctg 
cagttcactg 
aaaaataaat 
ctgtattact 
cactttggaa 
taacacagta 
gcctgtaacc 
aggttgcagt 
tgcctctaaa 
tacagatgag 
gaggtctctc 
cagggagagt 
aaggcttcga 
caggtgaaca 
gggcaggccc 
cagaggcctt 
aaccaagacc 
cttgaggccc 
ggctcaaaat 
actccacaaa 
ggaaacttag 
cattccccag 
caggtcctcc 
ccaagggctc 
tgtcagccct 
atctgaccag 
tgcagtcttg 
acacccccaa 
agggagaact 
tgcgctctct 
taaagtgcaa 
tggctcaaag 
ctaaaacttt 
ggcataaact 
aaatgtggcc 
gggctggact 
gccagttccc 
gaatcagagg 
gcctgcattg 
ctagctgctg 
tggcctcata 
agggccatta 
gcacccatga 
atagctccag 
ggaaactgta 
cccagacacc 
tgttacgtaa 
tgggcgcagt 
cttgaggtca 
aatacaaaaa 
gaggcaggag 
ctgcacrttta 
agaaaagaaa 
ggaggaaaag 
tcccagccct 
aaccctctcg 
gaggtgaggg 
ggctctgacg 
tccactcacc 



27840 
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27960 

28020 

28080 
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tgcctcagcc 
ctcctctggc 
gatccgtaca 
ctcaacagct 
cagcactttg 
gagcaacacg 
cgtgcacctg 
aggcagaggt 
agactccgtc 
ccttccccct 
ttttcctgca 
gcctggagag 
taccgtgggt 
ggcaacagca 
agagcagcgg 
gccagcgccg 
agccgctgga 
agggatttag 
gggagaaagg 
ggcggctgct 
gagataaaca 
taacactcct 
cacacataca 
aatttcatat 
agggacgggg 
gctgctccct 
cgactgacat 
aatgtaacct 
ctagaacaga 
acaccccgac 
cctgcacagt 
cagcccactc 
tgctgccagg 
ccaaagccat 
gcagcagatg 
tccgccttca 
tccacgtccc 
agacacaggg 
acggtccctg 
ggtcgggggc 
aaggaaccaa 
ctgcgctgct 
gctcagcaca 
cctgctttca 
atggggctgc 
cacagcctga 
tgtaatatgc 
catatctaga 
attaaaatgg 
caggcagatc 
tctctaaata 
taaatgatag 
cactgttata 
aatgagaacc 
taacggtaaa 
acaaagggct 
nnnnnnnnnn 
nnnnnnnnnn 
ttccttcctt 
gcacttggtg 
ggctgtcaaa 
ataaaataag 
tctgctgaat 
ttatgaagca 



aagccctcag 
cccaccatcc 
aagcctaaac 
ctggtaaaaa 
ggaggccgag 
gtgaaacccc 
tagtcccagc 
tgcagtgagc 
tcaacaacaa 
tcccctgaag 
ccctgcatcc 
gggcagggcc 
aaggcgaggc 
gctgggcccg 
cccagggggc 
gaccctgcca 
caagccccac 
ggccctgggc 
gatccgcttc 
ggggagaggt 
gtccgagcca 
aagcctttaa 
ccactcacca 
ggttcaacct 
tcacacaccc 
ccccctcaag 
aatttattag 
ccagcaggga 
gctccatgga 
acacagcctc 
ttctcacaca 
atactgccct 
ccaggcctgt 
tcagggacag 
agggcatttg 
tcccacctat 
gggaggccag 
aagggcgagg 
tcctcccgtt 
caaagcggtc 
tggtgaagcc 
catcctgggt 
tgacaccagg 
aatcccatgc 
tccagaaatg 
ggcttcaact 
actcagtgtc 
ggcaaaggca 
cccggcgtgg 
gcttaagcct 
aaaaaaagaa 
tctgataaaa 
agtcagttaa 
aggcactgcc 
ctgccaactt 
gcttcgcttt 
nnnnnnnnnn 
nntccctccc 
ccctccatct 
aaagtttcaa 
ggagacttac 
ggccttttgc 
cagatagcta 
ctccaaagtt 



ggcaggggaa 
tccccaagtc 
acactccaca 
aggcgtttag 
gcgggtggat 
gtctccacta 
tactcgggag 
cgagatcgcg 
aaaaaaaaaa 
tggctggctc 
gtgaggcacc 
ccctcccctc 
cggccggcta 
caagactcag 
ggggccggcc 
ttggctggag 
ccggccgcca 
caagttgcac 
cttcctttag 
gaaattcagc 
gcccggccag 
cgcgtttaca 
catgtaatag 
agtacactca 
actctcccac 
atcatgttac 
tttacttgtt 
ggatgactcg 
cattcatggg 
atccacacac 
ctcacttgat 
caccccactc 
gacactcacc 
cgccaggaat 
gggagctgtg 
tcttttcttg 
ggccatgagg 
ggttggcctg 
ccccatagcc 
catgaggatc 
caggttgagc 
ggtgttggtg 
aacagctggg 
caagtgcctt 
gcagccatta 
agctcaaatg 
caagcttagg 
aaggcagtga 
tagttcatgc 
acaagtttga 
atttagaaag 
taatagctaa 
taaagtatcc 
ctccagtccc 
ggagttgtga 
actcaatgct 
nnnnnnnnnn 
tccctccctc 
tctccacacc 
ttctcctgtg 
ccaatctttc 
atctgattta 
agggggccct 
tgagacacgc 



ggcaaagtca 
actagatccc 
gagggggaaa 
gccgggcgca 
tgcctgagct 
aaatacgaaa 
gctgaggcag 
ccactgcact 
atggtgttta 
aggaaaaacc 
actgccaagg 
caaggggcca 
accccgggct 
cacgggacgt 
agagggagcg 
attacaggag 
gggagggtct 
agcagggaga 
agctgtgaaa 
caaaaccacc 
gaaccttccc 
cactcacata 
gtcaagccat 
caaacacacc 
gacatggcaa 
ccagttttat 
tattgggtta 
gtcagtcctg 
ctctgtacac 
acacagcctc 
ctagtgatct 
actctgccct 
gtgagtgaag 
atcaacggag 
ggaagccaag 
tggggccaca 
gtgcaggacg 
tgagcacccc 
cagccacctc 
cccagtggca 
atctcgtcct 
ctgctctcag 
cacaggagac 
tgggggtacc 
gtacctgacc 
aaatactgga 
gggttgtgga 
gtggtactct 
ctgtaatccc 
gaccagcctg 
aacactaaaa 
tacttattga 
cgttccctag 
ctctagaagt 
caagttaagg 
cannnnnnnn 
nnnnnnnnnn 
cctcccttcc 
tggtatcatc 
tggagaggtg 
cagcccacca 
agtaggaagc 
ggaatctgca 
ctcaaaggtg 



ggatgccctt 
acagctgaga 
ctgagactct 
gtggctcaca 
caggagttcg 
aaattagccg 
gagaattgct 
ccagcctggg 
aacacatata 
tctacccact 
acgccaggga 
caaacgctgt 
ggcggccttg 
cctcgtccaa 
gggagaggct 
gcggggacat 
gaggtcaaga 
aggggctgcg 
tgtccccggt 
cagtcaggca 
ctccaacctc 
aataaacaca 
gtgcacgacg 
taccaactca 
gcgtgcacac 
tttcttccca 
tctgtgcccc 
attgtgctgt 
acaaacacac 
acacctgctc 
gcgtccacag 
caccccactc 
tgaacgttag 
acagagctgg 
ggcgggagct 
agaggacaga 
cagtgaagca 
ccctcccctc 
acctgccaac 

gggtggtggc 

gctggtcaca 
ctgaggaggg 
agcagcccac 
ctagagtcac 
ctgggagagt 
cataaaagta 
cccccaacaa 
aatggctata 
accactgtgg 
ggcaacatgg 
cttagaggaa 
gcacttaact 
gtgatgaagc 
ccacttggag 
agaaaagcta 
nnnnnnnnnn 
nnnnnnnnnn 
ttccttcctt 
atacagaagc 
agcactgagg 
atcccttgcc 
tgattcctga 
ttttagcaag 
gagtggttct 



cgcacaca cc 
aggaccttag 
gaagggaggc 
cctgtaatcc 
cgaccagcct 
ggcgtggagg 
tgaacctagg 
cgacactgcg 
actaaattat 
caggcagagg 
aggctgccag 
ctgcgcccag 
cagcgtgcgt 
gtctgggcca 
gaggggcggt 
agcagggagg 
gccggagaga 
cagaggggcg 
tggaattaaa 
gcccttctca 
cctaagcctt 
ctttgagcaa 
aggtgtcgac 
tggctttcac 
gctatctcaa 
gcacctatga 
tcacccccaa 
agtccaggac 
acattaacat 
tttgcagcca 
gcccctcccc 
gggggaactc 
gcagatgcca 
aaaggggaaa 
ggggtaaaca 
caactcacct 
ggcactgtgg 
cccctgcagc 
cagccgcacg 
gctgagcacg 
gcctggccac 
ggaagggagg 
agtcaggcgg 
atctcctctg 
cttgtgcaca 
tttactaagt 
gaagtgcccc 
acaagaattc 
gaggctgaga 
taaaacccca 
gctttcccga 
atgctccagg 
tgaggcacag 
gacttgtcct 
gtgataggag 
nnnnnnnnnn 
nnnnnnnnnn 
ccttccttcc 
agagaggact 
aaggggtggg 
cagtgtttct 
gcccctcaga 
cggaggtggg 
gtggggggca 
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gaaaggaaaa 
tggaaagggt 
caagccagac 
cgaagcagac 
cgtggagacc 
ctcatccccg 
gtccatcact 
ttctacccca 
cctttgctat 
ctcgcccctg 
gaccccgcca 
ggattgcgcg 
cgcatatatg 
aggcttcgaa 
cgcaccctct 
ccagctcgct 
cagacgccct 
ggccagaggc 
ggctgcagat 
cgggcagagc 
accctcggag 
cgcacgtggc 
actattcgca 
agccagccga 
aaaaagtcag 
ctttgccaag 
tctctcttat 
aatgctgtct 
actcttggcc 
gctatccttt 
cccttgggtg 
caacaccaac 
atcctaaacc 
tgtcccagag 
ctcttgtcct 
ttatttattt 
aaggaagctc 
actttcggca 
tctaaagcat 
aaatcctggg 
gttgggaaca 
ctagttaatg 
caccaaccgc 
taagacttgc 
gacccccaac 
agggctgtgt 
gcacatggga 
accgctagaa 
aacccctcta 
gaaggttttt 
taaatctacg 
cttttggttg 
gcggtcacct 
tgctggagcc 
aatagcagga 
agtcatctgc 
agccagaata 
ttaatgttca 
agggcctggt 
cactgtagga 
tctgaccttt 
agacacagtt 
gacacgcggg 
gcaaataatg 



tgcaaagggg 
gatgccaggt 
cggggaggag 
acttgcaggg 
ctggagagcc 
ggcaccagcg 
cccccggcca 
cagtccccag 
ttgcgatccc 
ggagggcggc 
ccccgtcccg 
gcagggtctg 
cccccctttc 
attcctcgtt 
cccctcccca 
tccgcccctg 
tctcctagga 
aagcttcgag 
aacgcttgca 
aagccccggg 
ccagccgcag 
gcttaccccc 
aattgaggat 
acccgctggt 
tgttggaagt 
tctccagacc 
taacttctca 
tacagagcca 
ccaggagcag 
ccacagactg 
tgtgccttaa 
ctcttaggct 
aatgttagtt 
gcacaagcag 
cactgatggt 
actttaaagc 
aaaatttgca 
ggccctaggg 
tctgtcctaa 
aagtaggggg 
gggaggtgag 
cccagctgag 
caaagacagg 
aagcagcagc 
ccctaaccca 
ctctctcaca 
cctcaggggt 
ggagataaga 
ggatagggac 
tttcctcctt 
gccatggctc 
ctcaggcctt 
cagagccgct 
gggcacggta 
ggcccagatt 
ccttagcctc 
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agttaaaaaa 
ttatttcatc 
gaaagatcag 
gcagggctgc 
aagggtcccc 
ccaatggtgc 
gtttactaaa 
actctgggag 
aacatggcaa 
gtagccatca 
caggaatcag 
aacaagacac 
atgtaatccc 
ccatcctggc 
ttagctgggc 
aatggcatga 
gcctgggcaa 
aaaataacaa 
tcaggggact 
cttggaaacc 
gcaccacata 
aaccctcgct 
tggctgctga 
aaaaggggtt 
tcagcccagg 
nnnnnnnnnn 



aagtatgcat 
acaattcttt 
cgtcagggag 
ctcgtgcagg 
tttcaggggt 
ctgaacacag 
aacagactcc 
gccaaggtga 
gaccctgttt 
cctgtggttg 
aggctgcagt 
cttctctaca 
agcactttgg 
tagcacggtg 
gtggtggtgg 
acccgggagg 
cagaatgaga 
taaattaaaa 
ggggaatctg 
tgcagattag 
caaggcccta 
ctcacccaga 
agcaaggcga 
ggatgggaac 
caagaacttt 
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aagaaaaaaa 
tcaatgtttg 
aaaagtttat 
aaaatatctt 
ttttattttt 
tcttggctca 
aagtagctgg 
ttgaaatgga 
tgcaacctcc 
attacaggca 
tcaactaaag 
cccacctcgg 
tacagtgttt 
actactcact 
agtgccctat 
tctatgatta 
agtactatgt 
atactgtcta 
atttctctga 
ctcaaatttt 
ggcgcaatct 
gcctcctgag 
ttttattaag 
ggtgatctgc 
cagccattac 
tgtcatcaag 
gctcaaggga 
atgctcagct 
ctcaggtttg 
ctgggattac 
ccacttatgt 
ttgcttaagg 
tatgtgtgtg 
cacaaacatg 
gagtacttgt 
atgccctgaa 
ttcctgcaca 



aatcttaaag 
tatttcaagc 
aaagtaaaaa 
tgtataaact 
ttgaaatgga 
cggcaagctc 
gattataggc 
attttgctct 
acctcccagg 
tgcaccacca 
agaaccatgt 
cctcccaaag 
ataaagcctc 
cactcactca 
ataggtgaac 
gctacacaaa 
atgggtttgt 
gtttgtacac 
gtgtattcct 
tttttctttt 
cggcgcactg 
tagttaggac 
agatggggtt 
ccgccttggc 
gtttttttgg 
gctggagtgc 
ccctcctgcc 
aattattttt 
tctcaaactc 
aggtgtgagc 
tttcaagtta 
gatctgcagg 
tttatgtacg 
taattataag 
gccagtctgg 
gtacggtagg 
agctcagctg 



catgcttagc 
gggaggaggg 
gcagatgagt 
agtgggtgcc 
gctggagaca 
gcccaagagt 
taggaggtca 
gaagatcatt 
atctgtacaa 
cagctactca 
gagctgtgat 
aaaaaaaaaa 
gaggctgagg 
aaaccccgtc 
gcgcctgtgg 
cggagcttgc 
ctccgtctca 
acaaaaacag 
tatttttaac 
agacctctga 
cccctgcccc 
agggctctcc 
acacagcact 
ctgtccccaa 
ttcttttcct 
nnnnnnnnnn 
nncactccag 
aataaggata 
caagtgttat 
agctaagtaa 
tagtgtagtc 
gtttcactct 
tgtctcctgg 
acccgccacc 
ttgacccagg 
ttcaagagat 
cactcggcta 
tggccaggct 
tgctgggatt 
cagtagtgta 
cccagagcaa 
cattttttaa 
tgcttaccat 
agcctaggct 
tctatcatgt 
gttgttaagc 
gatacagggt 
cgacatctac 
tacaggcacg 
ttgccatgtt 
ctcccaacat 
ttgtttaatt 
aatggcacaa 
tcagccttcc 
tatcttttat 
ctgggctcaa 
ctgaaaacct 
agcaaggttt 
cctgacctcg 
tgcatgatgg 
cagggataaa 
tgatcaggaa 
agttggttag 
ggcactgagg 



acatagtgag 
tttacgtgtg 
tcaatgtaag 
ctaccactga 
ggatcccaca 
caggactggc 
ggcactgtgg 
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gaaggctggg 
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aacaataggg 
agggcagatc 
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tcccagctac 
agtgagccga 
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actcctacgg 
actccgtgag 
ggtgcctctg 
caggagagtg 
tttcaggggt 
ggtgagggac 
aacgggagat 
tcccaacatg 
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cttgggtgac 
taaagaaaga 
tacaaaacag 
gctagggtta 
taagtgtaca 
tgttgcccag 
gttcaagcga 
atgcatggct 
ctggagtgca 
tctcctgcct 
atttttgtat 
ggtctagaat 
gcaggcatga 
cagcaatgtc 
ctgccagtcc 
tattttatac 
tgtgttacaa 
ataccatgtt 
ttgcataaag 
aacacatgta 
cttgctttgt 
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ggctaggctg 
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aggtcactta 
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ttctatgatg 
caatgtcctt 
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ctagaccttc 
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gcctacgtgt 
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taaacattta 
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caatcgattc 
cctggctaat 
gtctcgaact 
caggcatgag 
taagagacag 
ctgcagcctc 
gactacaggt 
gggggggtct 
tgctttggcc 
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tgaagcctga 
gttttaacag 
cagaaatcaa 
aagaagtatg 
tgggaagtga 
gtaagaccca 
ctctgctaca 
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agtagctggg 
gacagggttc 
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ccagccagta 
acattcactt 
atgcatgata 
ctgcaccttt 
taatcagtac 
gtagtcgtct 
ctaatgacac 
caagaaatag 
ggagtgcagt 
tccggcctta 
ttttttgtat 
cctgacctca 
ccaccatgcc 
attctcactc 
caactcctgg 
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ttctatgttg 
tcccaaaggg 
taggcccagg 
ctagttttgt 
tgtctatgtg 
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gagcagaaag 
agaggctgat 
gcaagactcc 
gcagtctcag 
gcagtgcgcc 
caaaactgaa 
ggctgaggtg 
aaacccgata 
gaggctgagg 
gcaccactgc 
aataacaatt 
ggtcacttta 
atacgacagt 
ctgccctgcc 
ttctcagtga 
atggaatgca 
atgttcatgg 
tggggctcag 
ggaggtttat 
gcaagcctca 
gtctgtggtc 
catgaagcgt 
accaatactg 
ttgagaaatc 
tggctcacgc 
tgagtttgag 
agccaggcgt 
tcacatgaac 
ctcttggcaa 
gtagactttg 
acacctccag 
gtgtactttt 
tcagtccctc 
cactcaatac 
gagcctcatt 
tctccgagtt 
tgggtggctt 
aatatggctt 
gaggtgcaca 
gggaggcctg 
gctgcagccc 
tctgtctctg 
ctcttccaaa 
ctgcgccact 
gcctcttagg 
ctagttactc 
aggccaagcc 
gtgggggtgg 
gcaaactcta 
acaggaatca 
gaggggtata 
ggcacatctg 
tcttgctctg 
cctcctgggt 
acaccaccac 
ccaggcttgt 
tggtattata 
gggataatgt 
ataagttttc 
caagattcag 
actaggagag 
gctccaattc 
ttcaagttct 
ctgtttgatg 



gctgaggaag 
gggggaaaat 
atccctgtaa 
ctactcaaga 
acgattgtgc 
tgaataggct 
ggtggatcac 
caaaaattaa 
catgagaatg 
actccagcct 
aaaaaaaaat 
gtaaagggag 
catggaaatg 
ttggccactt 
gtagctctca 
taaaagatgg 
agccctggga 
aggaggaagg 
ggagaaaggg 
actctgtaga 
ttggggacat 
atctcttcgc 
aagatcttta 
aggacaagtg 
ctgtaatccc 
actagcctgg 
ggtggtgcat 
ccaggaggcg 
cagagccaga 
tgtttttctc 
tcttgcctca 
tgattgtcct 
cccatcccga 
atcttgcatc 
atagccgttc 
tcgcttgtca 
gctttatctg 
aatggtggac 
agttagaggg 
tccagctgct 
aggtgttctg 
acttgtcaac 
agccttgggc 
cacctgctct 
tatcacttcc 
cttggaggta 
cagcctaaag 
caatggagtt 
gagaaccgtg 
gcatcccaca 
gggaggagcc 
tgttttcttt 
tcgcccaggc 
tcaagtgatt 
acccagctga 
cttgaaatcc 
ggtgtgagcc 
gcattttgtg 
ccccagggag 
cctctcaaaa 
ggcaagtgag 
ctagtgatag 
gagattgtta 
acgtcccaca 



gctgggtgtg 
cggtagagct 
aaagctttta 
ggctggggta 
cactgtactc 
gtgtgcggtg 
ctgtgattgg 
ctgggcatgg 
tcttgaaccc 
gggagacagc 
taaaaggcca 
aacaatggct 
ctagggccca 
cctggccctc 
tttagtgcca 
ggctgtgata 
gggagctcag 
acttgttcaa 
tgaccatcca 
cttggtgggg 
tgtcgctccc 
cgtcccccat 
gtaaagttct 
agtcagggcc 
agcactttgg 
ccaacatggt 
gcctgtaatc 
gaggttgcag 
ctacctctca 
aaaagcactg 
gggtcagttt 
aaatccagag 
atcccaggga 
ctcgctggac 
gttcctgttg 
gcatttctcc 
aaattttcag 
cctgtcaggg 
ttagacaagt 
ccacggttga 
gctgccctag 
taatggcact 
ctctgactgc 
ggcgctaggc 
cctgatccca 
tctgcttctc 
gacgcttcct 
ggggggggct 
taaataggag 
gcccatgttc 
agcagggtct 
tattttattt 
tggagtgcag 
ctcctgcctc 
tttttgcaat 
tgacctcaag 
actgcgctcg 
gaagcttggg 
gacactgttc 
tggagacagc 
tggtgtttgg 
gaaaccatta 
acacacctta 
ctgtgggtac 
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aaaattagct 
aggattgctt 
caacctagga 
gctcactcct 
gagtttgaga 
tggctcacat 
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gggagcactg 
cctcccagga 
ggcagaccat 
tgcatgcccc 
gggctctcgg 
gcccagagct 
tgcaagttca 
agacacacag 
aggcttggac 
gccaggccct 
cttcctgctg 
ccttgctgag 
cttttttttc 
aggacaaaaa 
gaggccaagg 
gaaacctcgt 
ccagctattc 
cgagctgaaa 
aaacaaaaac 
tcaagccagt 
ccagcctccc 
tctgtggcct 
gccgcaggtg 
ccaatccatt 
acctttccag 
aatcccatca 
atttgacttc 
gtagagaaaa 
ccagccacaa 
gggtggagca 
ctgcctggct 
atgagattgc 
aacatggagt 
gtgtgcctaa 
aatacttacc 
accggggact 
acatgacttc 
cagggagggg 
tgattattct 
agctatgaag 
tgagttcata 
tgaatttaat 
tggcgcggtc 
agcctcccga 
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gcagcctgtc 
cctctgggat 
ctcagggaaa 
agggtctcag 
gcttacatcc 
aggggtttga 
tttctctttt 
ggagtgtttc 
aaagatcatg 
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atgctctgct 
agaggatggg 
attttctgaa 
acagtgtggg 
tggcggatca 
ctctacaaaa 
gggaggctga 
ttgggccact 
aaaaacaaac 
gcccgcagca 
tggacacttc 
gacctggttt 
tgtgcagaag 
ggcttggtga 
atcaatctgc 
tgtactttgg 
aggtctctcc 
tattgaggag 
ccagcccaag 
tacaggaagg 
ttggtagaag 
acataattaa 
ctgggtatca 
tcacttaatt 
aggtgtggga 
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aggcttgcgg 
atgtggaagt 
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ttagtgccct 
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acagggtttc 
gcctcggcct 
tgttttaaat 
ctaggactct 
gtcaggaccc 
ctgggctggg 
agtattatga 
catcttaaaa 
gccgttatta 
aggaatttgc 



cagggctttg 
gagcaacata 
gtatgcatct 
ggtggacgct 
gatcctgtct 
cacttttgga 
caatatggtg 
agctactcgg 
agctgagatc 
aaataataat 
caaggtttca 
ctcagcattg 
acaagtggct 
caccaagctg 
tacgatgacg 
atctcatgag 
ttggttgaga 
agtgtgggac 
acttcgacca 
cctgacaggt 
gtccctctcc 
ttctcttctg 
agtccctctc 
acgagtgtgg 
cttgaggtca 
tacaaaaatt 
ggcaggagaa 
gcactctggc 
gacaaacagt 

gtgggcct^s 

ccccaggtat 
gtcacagctc 
aggcacacca 
tgtacagact 
cagcttggct 
acctctttgt 
tttgtccctt 
ccctgacttt 
ctgcagtgta 
cttccttctt 
aaagaaaggc 
cctgggtctg 
ctccccatcc 
tctctgtgct 
tgacacctga 
acgaaaagca 
gggctggagc 
gctttgcttt 
tttctttcca 
gaggctccgg 
ttcctccata 
tttggcagag 
gcaatctccg 
ttacaggtgt 
acagtgttgg 
cccaaagtgc 
gagaggaaag 
tatgatcttc 
ccagtcctta 
ttctgttcac 
aaacagagat 
ccaagagcgg 
ggaagaagct 
tttttcaaat 
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cccagagaag 
gaattcaaga 
gttcagagct 
ggtagagctc 
tggctcatgc 
gagtttgaga 
agccaggcat 
ttgcttgaac 
ctgggcaaca 
acatactgtt 
cagctatata 
cctcctcagc 
tctaactttt 
aggcatgatg 
ttgagcccag 
aaaataaatt 
gagaggagaa 
gcactgcagc 
cgcggtggct 
gtcaggagat 
aaaaaagaaa 
ttgggaggct 
gatcgcacca 
ggaaaaagaa 
tatcacatct 
atgcagtatt 
catttgggct 
cactacttct 
cctgatcata 
cttttttgtg 
ggtggaaggg 
cacctggttg 
tatgtggtta 
gacaacaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ggacatttgg 
actagtatct 
ggcctctgta 
agtaggcgct 
ctggtgtgat 
agaggtggag 
gatccaggtg 
tggctggatg 
aggggtgaag 
ctgagaaagt 
gtgtgtgttg 
ggagaagcat 
ggagatgaag 
atgacataac 
atccccaata 
aatggccaca 
atcactactg 
ggggtagtca 
acaagactgc 
ccacttgctg 
taacctccaa 
atggttggcc 
gtctgttgag 
actttaagtg 
acttctgagt 
ttatttatca 
ataaatgaat 
gaggctcaga 



taagattaaa 
atgactgacc 
ctcccaatag 
aggaatctaa 
ctgtaatccc 
ccagcctggc 
ggtggcagac 
ctgggaggca 
gagggagact 
acaggcacag 
actaccacct 
agaggtaaca 
ctatcaatta 
actcatgcct 
gagttggaga 
agctggtcat 
tctcttgagc 
ctaggtgaca 
caagcctgta 
ggagaccatc 
gaaaaaaaaa 
gaggcaagag 
ttgcactcca 
aaaatatata 
gcgaaattaa 
ccgtcgtgtg 
gttaccaggt 
ggtgagcgta 
aggtaggtgt 
tatttctggt 
aagcaggagt 
gcatggagct 
gctccatgca 
gctgatgtgg 
nnnnnnnnnn 
nnnnnnnnnc 
taatgtctgc 
ttttggtaga 
acaaaaaagt 
aaaacataag 
tagctctggg 
gagggtggta 
aaaaacggtc 
agatggtgaa 
gatgaccaga 
gacca caaaa 
agttcagtct 
cttgggtgac 
acttgggaat 
ccgtggtggg 
tttaaccatc 
aaggctactg 
cagaggtcaa 
gtgggcacct 
tatggtaaga 
gaggagcttg 
gaaacacaaa 
cctagaaatg 
cccaggctaa 
atcctgacaa 
atctcctcaa 
gctaccttaa 
gaatcaatcg 
aaatacctgt 



gttggctgtt 
atacagaatg 
tcacccctga 
aatccatttt 
agcactttgg 
cacatggtga 
acctgtagtc 
gaggttgcag 
ctgtctcaaa 
accttaagtg 
atatcaagac 
gacccacacc 
gttttgccca 
gtaatctcag 
ctactctgga 
ggtggtgcgt 
ctgggaggtt 
gagtgagatt 
atcccagcac 
ctggctaaca 
ttagccaggc 
aatggtgtga 
gcctgggcga 
tacattgtgt 
tctacactgt 
actacgggac 
tctggctgtt 
tgcactcatt 
gttggctttg 
tagagcggaa 
cggtttctgg 
ggcttctcct 
atgaacccgg 
atttaaaggc 
nnnnnnnnnn 
cagcagtggt 
agacattttt 
ggctagagag 
atccagtcaa 
gagactgtgc 
ttttagaaag 
agactggagg 
agcaggtgac 
gaaagcacta 
gtcctgcctt 
agtgaagcag 
gagatgtgtt 
catatgtgtg 
gatctgcgta 
tgtgcagaat 
tggaagaata 
agaagggaag 
gtaggataag 
gggcaaaagc 
9gaggaagag 
gctttggtgc 
atcatccata 
ccatcccact 
gcgctacctc 
cactgaaaat 
tatattgcct 
acctccctgc 
atgatccaga 
agaatcgaaa 



ctccatcctt 
gggagcaaac 
actgcacccg 
aaaattaaag 
gaggccgagg 
aaccccgtct 
ccagctattc 
taagccaaga 
aaaaaaaaaa 
tacagcccaa 
acattccagg 
tctcctgctc 
ttcttgagct 
cactttggga 
caacatagtg 
gcctgtagtc 
gaggctgaag 
ctgcctcaaa 
tttgggaggc 
cggtgaaacc 
atggtggcgg 
acccgggagg 
cagagtaaga 
actttttggc 
gtgtatgaaa 
aatttgctta 
atgaataaag 
tcgcttatgt 
taatgtgctg 
catgagggtg 
ctcacacatg 
ttggcgttgc 
gcttctgcaa 
ttcagttcan 
nnnnnnnnnn 
tctcaactga 
tgattgtcac 
gctgctaaac 
aaatgtccac 
ctgagagcaa 
ctcattttgg 
cagggaaagt 
taggaaagtg 
taactaacta 
gcaggtctag 
gtttgtgcgt 
ggactcacaa 
agtctgcagc 
tatatttggt 
taggagagac 
agaggagcct 
cagttcttaa 
aactgaagaa 
agttttggtg 
ggtgttgagg 
aaagcagaga 
tcctggctca 
tctcctctgc 
ctcaagcaag 
gtgtgtctct 
tgttttacta 
aactagagat 
gcctggtaga 
taaatgcatg 



gaaaaatttg 
ttgggaagaa 
gaccatcagt 
tatatcgggg 
tgggaggatc 
ctactaacaa 
ggaaggctga 
ttgtgccact 
aaaaaattaa 
tgaaatttta 
aactcagact 
cggtggtaat 
tcacacagat 
ggccgagacg 
agacccccga 
ttagctattt 
tgagccgtga 
aaagaaaaaa 
caaggcgggc 
ctgtctctac 
gctcttgtag 
cagagcttgc 
ctctgtctca 
atctggttta 
ggttggttct 
tccgtattcc 
ttgctatgga 
aaatatcttg 
acttggttat 
tctcttcagg 
ttgtgactga 
ctactgttgg 
aatacattaa 
nnnnnnnnnn 
nnnnnnnnnn 
gcatagtttt 
agcccagccg 
atctaacaat 
agtgttgaga 
gaaggagtaa 
ctgcttgtag 
aatttgggag 
gcagaggcaa 
atgtgtggat 
ttggaaggtg 
gtgtgtgtgt 
tgtccatggg 
tcagaaacag 
agcttgagcc 
gtgcaccaag 
gccaacagaa 
gaagggggaa 
tgtctgttgg 
gagcaatagg 
aagtggccag 
agccagctca 
aattccagca 
ttatcctatc 
ccttctctgc 
tccattcatg 
atatgctcgt 
tctctttaag 
ggcttgtgtc 
tgtgctctga 



gttttagggt 
agaaggcaca 
tatctctgtg 
ct ggg c 9cgg 
acgaggtcag 
tacaaaaatt 
gtcagaagaa 
gcactccagc 
agtatgtcat 
cacatctata 
ccatcatacc 
taaccactat 
atacattgtc 
ggagtatcac 
ctctacaaaa 
gagacgctga 
ttgcaccact 
tatggccggg 
ggatcacgag 
taaaaataca 
tcccagttac 
agtgagccga 
aaaaaaaaaa 
ttttgctcaa 
ttttgttgtg 
tatcggtggg 
tattcttgta 
ggtggaatta 
gctgaattcc 
gaatctggag 
actgctggta 
ggcaggtgtg 
caacgacaga 
nnnnnnnnnn 
nnnnnnnnnn 
gcctcagagg 
agaaggtact 
gcacaggaca 
ggtttaggta 
ttggaaagtg 
acagtgcatc 
ccactgaaat 
tggggatggg 
gatgggcagg 
atggtttctc 
gtgtgtgtgt 
acatccaagt 
gcctggggct 
acaagagtag 
aagccaggtg 
attgggaggg 
gtgaagaggt 
gtttggcaat 
gataacagaa 
cgagtctaca 
ctcattgact 
ctaccaggag 
ctatctgtca 
ctgccgtcac 
ttagttctac 
tctgtttgcc 
tatttgttga 
catggtggat 
tctaaactca 
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gctaaacttt 

tatgtaatgg 

acaaggtcct 

cagtaagaaa 

tgtgtaacaa 

tcggagaagg 

agtggagctg 

gacaagagcc 

actatgtggg 

aaatatggtc 

ggaaggtaga 

actgagtgtt 

ttcctgagga 

ggggccttcc 

aatggggaac 

gccattcacc 

gttggccaag 

gtcacttttc 

ctggcttcct 

tttctcagtg 

cccgttcctg 

agggagatgg 

tttagcagct 

cccagcatgg 

ggctgtgtgc 

tctgttccct 

tgctctcagg 

tgaagccagt 

cttcaggaag 

ggcagttcct 

aatccatcat 

tgcctgtcaa 

gttcttccag 

tcaaagaatc 

taattgcagg 

ctgcagggtg 

ctggagctgt 

aggggcatcc 

tggtgcacac 

cagtaccagg 

cctaaagttg 

ctgtcaggtc 

ctccaagctc 

cagtgatgag 

gggtatcgtt 

ttgaaattca 

agagcctggg 

caaaagtaga 

ggtagcattc 

gtcagggctc 

caggactgaa 

tggagccaaa 

gaccttgtgt 

agtgcctctg 

gaggccttct 

tgcttcttct 

tattaggaac 

tggaatccaa 

acatgtgaat 

acttcataaa 

gcagctatta 

agccaccctc 

gctaagctgt 

tgagggtccc 



ctccaggggg 

aaaaactcct 

tgttgtcatg 

gcaagattat 

aagtaccaaa 

cttatctgag 

ggaccaggat 

tgatcatgaa 

aagactttga 

gcaggggcta 

agatggggca 

catggtctct 

agcgggaaga 

agatctcaca 

tgttgtccag 

cttgcagtgg 

cccagttaac 

tgagggtggg 

atggtctgag 

tccatcttac 

tgggaccctt 

gccttgaggg 

tcaaacaact 

ctaggttttc 

tcatctggag 

ggagttgcag 

tcctcgaggc 

gatggagaat 

agagccctgt 

tttccttaaa 

atttacagtc 

aagggccaag 

aaaaaaatca 

tcatggaggg 

gttcatggtc 

tggggaggga 

ggtggttttt 

caagagctcc 

atgtttttct 

tctgaccctg 

ggatgagccc 

ccctttggaa 

tgtagtaact 

tcaacatttc 

ctcttgctca 

agtcctgagc 

caagtgggcc 

catcagtctc 

atttactaag 

ttaatggcag 

aggaaacgga 

gtcaggctgc 

ctggcaccca 

tgggtactgc 

tggcctccat 

ctttgataga 

ttttgtcttc 

gaaaccacag 

tatctcattc 

tgatgatatg 

agtgctagac 

tcagacactg 

ggatttcaga 

tgaacagtca 



taaagttcaa 

tctatgacct 

aagtttgctt 

atcattttgg 

taggagagtg 

gaggtggcct 

gacaagggtc 

agactttgca 

atgggaagca 

gtgaggtggc 

ggggaggcaa 

gggagcagag 

acccatctga 

tgccaatcgt 

gccgatttca 

cgttagctct 

aagcagttga 

gagagaatcc 

ggagagaatg 

ctctctgctg 

ctctgccaat 

ctgtattagt 

catgtttatt 

tgctcagggt 

tttagggttc 

ggctgaggtc 

tgcccacatt 

ttcccttgga 

ctcttttatg 

gtcaactgtg 

ccagggatta 

caggacttta 

ctcagtaaag 

tttggacctt 

ccttgaagcc 

tgcaggtgga 

ttagtcttta 

ttggtattga 

cctgttgcaa 

gacagcttgt 

cccgagaatt 

gtgctggtat 

ttagggctag 

ttggggaagg 

aaggcactgg 

acaccacagt 

ccttgtgagt 

ttgaaagttt 

caattactaa 

gcagcagaaa 

gtagctcatg 

agaacaggtc 

ggatgttgta 

aactccagga 

ctttttggtc 

gcccagtcat 

aaaaatgaga 

gaaggggtga 

atttctcaca 

aggtacagaa 

ccagtcaatt 

ctgtatactt 

tcctggattg 

actggtttcc 



gttgattagt 

gggcagagtt 

tctgaaggag 

taaatgttct 

gggtgggtgg 

tttaaccagt 

acttgtcatg 

gatggtggta 

tggggacaat 

agcagagata 

ttactgcaaa 

gttcctggag 

gacgtgggga 

cttggtctat 

cagaacaacc 

ttcaccagct 

tcagccccag 

tggagcagaa 

gtgggatctc 

taccttcgtt 

ccctacaccc 

cttctattct 

agctcatcgt 

ctcacaaggc 

tcttccaggc 

ctgttttctt 

gcttgccacg- 

ttgaatcacc 

ggatcacctg 

gcatgtaaca 

tgcacagtgc 

ttggtgaaga 

ttagaggttc 

caccctagaa 

tattcatagt 

ggtgagggct 

agctgtcatg 

caccatctcc 

cagcccactc 

accaggagct 

agatcagaaa 

ggagaggatt 

aaaggaggat 

aggcagggct 

accccaaggc 

tgtgatgcag 

accgttcaac 

gattaatgct 

ataccagttt 

ctctccttgg 

aaattgcagg 

tagggaggat 

gggctcagac 

actcaatctt 

actagctcca 

atgcgttagc 

ggcctgtcct 

ggtgactggg 

ctacccagtg 

agtttaagga 

tgagtctgac 

ccagtgaatg 

tatgacctaa 

aagaatgggc 



caattgatta 

ataggcagtg 

agagataata 

tgtggaaata 

gcttctttta 

acaaatgctt 

ccagtgagtt 

atgggtttgg 

ggcctgtgat 

gggagaagta 

gacatattcc 

gggaaagagg 

ctgtgctggt 

gtcaattgtt 

gcccagtcca 

tttatctgcc 

agatcaggtc 

catgtaacta 

tggcctgaat 

attttccagc 

actgtaaatt 

gcataacaaa 

gagttcatca 

taaaatcaag 

tcacgtggtt 

gctgactgtc 

tgcgtggtct 

cacatggttg 

attagatcat 

tcacacaacc 

accaggggac 

acagtggaat 

tcttgccttt 

acatcacacc 

ttccaggttg 

gaatagtgtg 

tgttgggggt 

aaggtgatct 

ttgtagaaga 

acagcacact 

agattaaatg 

gactgagtct 

gcctaagatt 

gaggattaaa 

ctccagctct 

ggaaagaatg 

ctcatttatg 

ggtcacactc 

ctgtgctaaa 

ctgatctaag 

aagggccgga 

cccactgctg 

cctggatcaa 

gtcaacgcca 

gattcaaaat 

tgcaaaggaa 

ccaccaagat 

cagctcacag 

aggtaggtat 

acttgcccag 

ttggactgtc 

ttgatgaaat 

aagagagact 

tccctctcat 



attaattcat 

aacaagacag 

aacaagaaac 

aatgtgatga 

gaaagagttc 

tagcttggcc 

tgagcttgta 

gttaattgct 

acatgttatc 

gacggactgg 

ttctaagctc 

ataatgtcac 

tcgtttctaa 

ggggcatcca 

tatctcccga 

ccgtggggat 

cctggagtct 

gaagggccac 

caaacctccc 

agctcctcag 

tcaccgtggg 

ttgcctcaaa 

gcagtgtggg 

atgttgtctg 

gtggcagaat 

agatgagggc 

tttccatcct 

gactctctga 

acccatagag 

acaggagtaa 

aactgaattc 

gtcattcttg 

tgggaagtca 

atgttttcta 

aaaagctctg 

agctgcatat 

tgggcatggg 

ctgctctgcc 

gcagacccct 

cccccacaag 

cagaggtgat 

gtttaggaac 

caggatcctg 

cggagatgat 

tcgctcccat 

tgcttatcag 

tcattggcac 

aaagaccctg 

tgctgcatca 

tagaaaaatc 

aaaccagaca 

ctgagaccta 

tgtatcctgc 

ccgccagaga 

cttgaataga 

gctgaaaatc 

ccataggaaa 

catgcatgct 

tgtcatccct 

gacacgacac 

tgactccaga 

tttcagggtt 

tccctaggag 

caccttatga 
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cagtaatcct 
cgcagagccc 
ttctgtcttt 
aaatgcatcg 
atgaaggaaa 
tcaagctgtt 
ccaggtagga 
gacttccttc 
tttgacctca 
tgcttcactc 
atgggagggg 
ttacgtgagt 
atgtgccggg 
caccatcacg 
aaatgactaa 
atccaataag 
ttggaaaatg 
taatacttgc 
tctcgcccaa 
gttcaagtga 
atgcctgact 
gtctcaacct 
caggcatgag 
atgttaccat 
aagttagaca 
cccatcttcc 
cgctagacct 
ctttaataag 
aatgcaaagt 
cccctgcttc 
catcaatggt 
ttgtaaaact 
tcacagaaac 
taacattctg 
attgttttca 
tataggcgca 
ttttttattt 
accagtaaaa 
gcccaggata 
gtatcgggag 
ggtccttatc 
gttcctccaa 
tttgggaaga 
gtggagctgg 
agggctggca 
tcatcctctg 
ggagatggtg 
taccttctaa 
ttatgtcata 
tttacttccg 
gtttagaaag 
gcttttctgg 
gtcccctaat 
ctgtgttctc 
tttggggcag 
ctgttttttc 
tcgtcttcta 
tgaaaggatt 
aattttgtcc 
ctctcttagc 
ggtggatgat 
accaccttga 
ccttcccagg 
ctggggagga 



ctgtccaaca 
agctcaaagc 
ttaaatcaat 
ttcctcgttt 
ttacagtgtc 
ccttctactt 
atcagctcct 
tgaattagca 
atttactcat 
tgattttatc 
agagaagttt 
acgttactta 
cactgtacta 
aaggaagaca 
ctgataagga 
ggcagtgttg 
gggtatcaac 
catgaacagt 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccactgcatc 
aatgaaggct 
tacgtgaaca 
atcacagtca 
attagccaag 
gcagcctcaa 
tataaaaaga 
tggaggcaac 
cacataagga 
attcacatac 
atctcttcag 
tagaatagag 
tttctacaaa 
tttatgccta 
taataggtat 
caggggagat 
agaaccattg 
agaaatcttc 
agccttgcta 
ggcctctata 
tctaaagacc 
tgacagttta 
ggcgtctacc 
ccttgcacat 
gagacatgga 
gtcctatgtg 
gactctgagg 
ggcctgaagg 
aggtccttgc 
cccatcctga 
ggtgtgattg 
cgttgcagga 
tttctgctct 
cctttctatg 
tttctgaatt 
tctcaaggcg 
tttctccatt 
atcgccatga 
ggaaccagta 
ccttgtcagg 
cagagttgag 
acaaggggag 



gccaaagagg 
tcctgactag 
ggagacaggg 
ttctcattcc 
ctgtgcatat 
gaaatgcctg 
tggttcatgg 
gaggatgttt 
cttacaaatt 
tccacctgga 
ggtgccagaa 
accattaatt 
agcaccaagg 
gatgttaatc 
caagcccctg 
cggggagatc 
cacctacctc 
attttgttga 
agtggtgcaa 
tcagactccc 
tttttagtag 
aaagatccac 
cagccacttg 
ggtaggttgg 
tttcctgtac 
gctgggaagc 
cagaagtttc 
aattacagat 
agataaacaa 
aaggtcaact 
tgctcacaga 
taaatacttt 
gtttatagat 
aggacacatt 
tagtcaagga 
aaggatagtc 
ggacagattt 
atgtattttg 
cccagagctg 
actgaatgaa 
tctccctctg 
taagaagtct 
caggaaggtc 
ttctcccagg 
tgtggctttc 
tccttctgcg 
ggagggtgtg 
ttcctaggaa 
gacccattaa 
aaagccagcc 
gtcctggatc 
tttctattca 
tgatgtgtgt 
taaagccaag 
tcatctaagt 
gtagatgggg 
tctgtgtgct 
tctgttatgt 
atgcttactt 
agtaggtggt 
cccaagccca 
cagcttgggt 
cactctgatg 
atgtgaagga 



tcctgtgggg 
gctcttgttg 
gagggttatc 
ttcccaatgt 
accaaggctg 
tttccttccc 
agccttttct 
cctagcttgg 
aggttgtaag 
tagcatcatt 
agaactggat 
acttcaatgt 
atacaatggt 
cattaaccaa 
gagctacaag 
aggagccaca 
actaggtttt 
ttgatgattg 
tctcagctca 
aagtagctgg 
agacaaggct 
ccacctcagc 
ccatgcatgg 
ccaactgagt 
ttgaagatcc 
ttttaaaaaa 
tgggcagggc 
tcagcacgca 
caatctgtct 
atttggtgtg 
taagcaccta 
cctcagtatc 
acaggtccag 
ttactcagtg 
ataacataac 
tcaaaagagt 
gttctcaaaa 
gaaaagcacc 
ttccagatgg 
tgagtgggct 
acagaggcaa 
ttgtgagagg 
tctgggtggg 
aggtccctgg 
aggttatgag 
ggatgtgaaa 
ggtggcttct 
agccaataat 
gtacaaacaa 
tcagccaccc 
ctggagcatc 
ggccttcttt 
gcatgtgtgt 
atgaaactcc 
aagtgttttt 
tcagagttac 
taaagggatg 
ctgtggtctt 
tgagcttact 
attgtatacc 
tgactgcccg 
tcccctcata 
cccagggcaa 
aggacactta 



agggcttgca 
agtattcctt 
tccatcctcg 
gtgtactgtt 
tccaacctcc 
ttctaattgc 
gctctgtttt 
tcttaaccct 
ctaattgaat 
tttgacacac 
ttgaattcta 
atatttatta 
gagtaaagag 
gtaatctcac 
agggtgtata 
cagagcctgg 
taaaatcagg 
attgaaacgg 
ctgcaacctc 
gattacaggc 
ttgcaatgtt 
ctcccaaagt 
catttaaaaa 
ggtctgattc 
tcaggacagt 
atgcagacat 
atctgcatat 
tttaccataa 
cctgctttct 
attcctttta 
tgcgggggtt 
ttgccttttt 
ctcttctttt 
tccgattgat 
tctgtaaaag 
gaaactgatc 
tgtttgtggc 
caaggcgatt 
cccctgggtt 
ccccagggaa 
actctctctc 
aagcaaagaa 
tgagtgcttt 
ctgtggctga 
gatgtcagca 
gtgctccttg 
tgaactctga 
cattgcttct 
ataagcgtaa 
ctcagggttt 
aggagctggg 
ttctccacct 
ctgtgtgtgt 
ccttacttct 
tgccttcagt 
acacccaccc 
gggactctat 
ggttctactg 
gagtgccttc 
catttcacag 
actctaagtc 
gagactgggt 
ggtgtgagct 
gctatcctcc 



gatgggagtg 
tgattcctgc 
gctcaagatg 
aactttagtt 
acacctttgc 
atctttccat 
actatgcatg 
tctccttttg 
acaggatcta 
aagcaggcat 
accctgttgt 
agtacctact 
atgcagcctt 
aagaaaagta 
cagggcatcg 
gttgtctcac 
ttaaatgagg 
agtctcactc 
tacttcctgg 
agccacccct 
gaccaggctg 
gctgggatca 
tgttcagtaa 
agaaggaaag 
gactcctaga 
ctgaccttca 
ttttaaaaat 
ccactgaaga 
tccctctcct 
gcattccctc 
ttttttttcc 
tcacttcatg 
catagccata 
ggatatcaat 
ttttattact 
aaatgtgcat 
agttcaaaac 
ctgaagtgta 
cctgaagtgg 
gtgatgaaat 
cctgggggaa 
ggacctgggc 
ctctgctgtg 
cagtttctgg 
ggggcagcct 
gctggggaaa 
ggaggggaca 
cccgcctttt 
tagtcccttc 
gctgcgttct 
cttggcatga 
cactcccacg 
caatgacaaa 
ggctcttcta 
ctttctttct 
ccttctttga 
ggccaggagt 
tgacattccc 
tctcctttaa 
aaatacagct 
catgctctta 
tccaggttcc 
gtctgtggtt 
ctgccagggt 



54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57.960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
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ctgagacttc cacctttgag acccctttgg gtgctaagac gctgcctgag gatgaggaga 58560 

caccagagca ggagatggag gagacccctt gcagggagct ggaggaagag gaggagtggg 58620 

gctctggaag tgaagatgcc tccaagaaag atggggctgt tgagtctatc tcagtgccag 58680 

atatggtgga caaaaacctt acgtgtcctg aggaagagga cacagtaaaa gtggtgggca 58740 

tccctgggtg ccagacctgc cgctacctcc tggtgagaag tcttcagacg tttagtcaag 58800 

cttgggtgag tggcctatgg ctgaggctga ggtgggagca tggaacgggt gtgggatatg 58860 

cccccagcat tgctatcact ggctcttttt cccattgagg gccctggggg tgtcagtaga 58920 

acctgagcct cagagaggtg ttggggtaag aggggagggc cacctacaaa cagaagttgc 58980 

attttggtct ccaaccttca aatggttgtg gcaggggagg gagggaatga attgtgggga 59040 

ctcaagaccc atgtgaattc atgtaggaag gatgctccat tctttgtctt ttatcctgcc 59100 

ctgtagttta cttgccggag gtgctacagg ggcaacctgg tttccatcca caacttcaat 59160 

attaattatc gaatccagtg ttctgtcagc gcgctcaacc agggtcaagt ctggattgga 59220 

ggcaggatca caggctcggt aagagaagtg tgaacactaa atggggtgca cctgctgatc 59280 

tcagccagca ctcagcttgc atcagatttg tctgtttttc tcctgtataa tctccagaag 5934 0 

aaccagggat agatggacac ccacagacaa cactgagggg gctgcctggg cattcaggga 59400 

agagctaagg atttagaatc aggaggtttg ggtccaagtt cctttccatc tctcactatc 594 60 

tatgtaactt aagttagctg ggcatggtgg tgcatgtctg taatcctagc tacttgggag 59520 

gctgaggcag gagagtcact ggaacctggg agacagaggt tgcggtgagc cgagatggag 59580 

ccattgcact ccagcctggg caacaagagc gaaactccgc ctcaaaaata aataaataaa 5964 0 

taaataaaat aaaaaaaaaa ttaaaacaag accatgagtt tgtttcctca tctctaggat 59700 

gagttggcaa cccttgttct accttttgtt agggctggaa ggacaagcct gtcactggga 597 60 

tgcatagaat ctgatggtga taattgccgt ggatcagcat ttcagatgac taggacagtt 59820 

cccatcatgg tccagcaggg aagggcccat tgcccggtgg gcagcagaaa gagctggcag 598 8 0 

atacggggcc aggtctgctt ctctgccttc cctctgcccc atcccttctt cccctcttgc 5994 0 

tttctccagg gtcgctgcag acgctttcag tgggttgacg gcagccgctg gaactttgca 60000 

tactgggctg ctcaccagcc ctggtcccgc ggtggtcact gcgtggccct gtgtacccga 60060 

ggtgaggtgg ggctggggat gaacgatgga aaggtctggg agatgggaag tgccccaagg 60120 

aggagatgct acaaagagcc tgaccctttg tgggagaggc ttcctgggtc ttttatatac 6018 0 

tctgactcca cagcagtgtg tgggtgggaa aagaggccct' cctgtgggtt gagttgggat 60240 

ggacaagagg ctgaaagtcc ctttctgttc tgccttcaca ggaggccact ggcgtcgagc 60300 

ccactgcctc agaagacttc ctttcatctg ttcctactga gctggtccca gccagcagtt 60360 

cagagctgcc ctctcctggg cagctgcctc ccctcctctg cttgccatcc ctccctccac 6042 0 

ctccctgcaa taaaatgggt tttactgaaa tggatttatt ttctcctctg atcgcggatc 60480 

cactctgctt agccctcatt gaaacttctt ccttatcatc tctccccaca ccacaacttt 6054 0 

catagaagtg tcagaagcta ctactccttg aggaggagga tggagggtgg agttgggtct 60600 

atggagcctt ttggagatgg aggaatgggc tcagctagtt ctcttcatag aacacctgat 60660 

tactgggcac ctgcatagtg ctgccaggac ctttcaaggt tgtaggtaga ctcccaatgg 60720 

cccagtttgc atctctgtaa ccaaaggcct tttctctctc tctctccaac cccagaactg 60780 

tggttggttt tatatgtaag gaagttaaca tgtccctggg aacagtccac aacattcagg 6084 0 

aatgaatgta taagtaccgc aatccccggc ccctcaagtg gaataaatct aacatgtatt 60900 

gggcaccatt tcccagtggc ctgctgtggt agttggcctt attccatgca tttttatggg 60960 

ctgccttccc ttcctcaact gcattctctg ctccttccta ctctctgcaa ctcccaaata 61020 

aacacttgta cgcaactccc tctctcagga tctccttctg gggaaacctg atataagaca 61080 

gcttgccatg cgtcagactc tgaatgaggc ctgggaatac aagacatagt cctctggcac 6114 0 

ttgggatata tggttatttg taacataggc acaaaaacat ctactagttg ttatcgctta 61200 

ttgagcaccc acaacatacc ccctgctgtg gcaggcacct tgcctagatg acctcatgtg 61260 

atcaataatt atgagcccta ttttacagaa ccaggctcag agaagttagg atctgtcaaa 61320 

agacttgccc aagactgaac ctctaaatgc aactcatatt gaaattcaac tctgctccaa 61380 

agcatgttac tttaaccctt gtgcttttac agctggctac tctcccctta tggtcacacg 61440 

gggatgaagc acggggggag gaaagccaga ctgtctcact cttgggttca tcttgggaca 61500 

caggacacca gcccagctgg aggtgaggga gctttaatca gaggggaggg aggaaggcat 61560 

tctcaacccc ttctgtacta gggaggtcag cagaagaaaa taattcaatg ttctaaagcc 61620 

atttttttct ccagcattcc tccaattcat agatcttcat atgggattag gggctcagag 61680 

aggggtgaaa caagaactct atttttttgg agtgtggtat agagaaggga tgctacttct 61740 

ctaaggtcac atagtaagtt gagaaagaga gagaaatcaa actcaggttc atttcaacta 61800 

ttgttccaca agaatctgtt gatttcaaag atggtggact atgggttcat ccctgtggtg 61860 

agtgctgtga ggatgcagct gaggtggaac tttcactcct tgccctcttg gactttatat 61920 

tctggtgtgg aaaggcattg cttcccttat ttcaatatta acaacaaagg gtaataatat 61980 

ttcccattta ttaagcattt actaggtgtc aggtactgtg ctaaatgtta ggtgaacttt 62040 

gtcttgttcc tcataaatct ctgccgctgt gggtgtgtac tttgacagaa gtttgacttc 62100 

cagtccacag agatcttctt tgggggagta atatcaagaa ggggcacgaa ggaagctgca 62160 

gggctcctag tcccatcctg tatctcgacc taggcatgtt tacattggtg cattcactgt 62220 

gaagtttccc tgagcagtcc actctatagt gtgctttata ggagcacatt gtacatccat 62280 

tgaaaaattt ttcttggccg ggcacggtgg ctcatgtctg taatcccagc actttgggag 62340 
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gccgagacag 
aaccccgtct 
tcccagctac 
agtgagccga 
aaaagaaaga 
tgggctaata 
ctaagctctg 
cactgtgcag 
caagggccac 
gatnnnnnnn 
nnnnnnnnnn 
agagttatgt 
gttttcttcc 
ttttcccaaa 
gtttcttgta 
attgtgtagg 
gagactggga 
cagaatcatg 
gaagaatcaa 
tcacaagaat 
cccacccaca 
cacagccaaa 
acactattct 
gcagggacct 
gttgccacag 
cagtagtgtt 
cctgtcagga 
gagtcagtgg 
agcattattg 
gtttctgttg 
cttcgctgac 
ggttcctcca 
ctgctgtgcc 
accttcatcc 
acccttgcgt 
aagctggagg 
actgggtttg 
ttcatgcact 
ctcgtattca 
tgtcttccct 
tgtctggcat 
ggtcctgtct 
gtctgagcag 
caggggagct 
gggtccctga 
tttggcaagc 
cccatcttca 
aggatgggga 
gtgatggcct 
ttggggcaaa 
ctcacgaaac 
caggcccacc 
caggacctct 
ctctaggagg 
ttacgtgttc 
tactgcacag 
gtttaaatcc 
cacgatcttg 
ctcgatctcc 
ggcatgtgcc 
gctatgttgc 
tccaaaatgc 
atctgaggcg 
gctggtgcct 



gcggatcacc 
ctactaaaaa 
tcaggaggtt 
gatcgtgcca 
aagagatttt 
ttccaaaata 
tgaaactgtt 
agcacagagt 
cctcactcaa 
nnnnnnnnnn 
nnnnnnnnnn 
tcattgattt 
tattattatt 
ggtaaaactg 
ccccttttcc 
tgtatataca 
agaaaaagag 
gcaggaggtg 
aagtggaaac 
agcgtgggaa 
atacgtggga 
ccatatcaat 
gcagagtttt 
ctcatgcctt 
cattcctgta 
acaaagagga 
gggtattgca 
ggttggagag 
atgagtttga 
ggacagtgga 
ttctgttggg 
ctttggcggt 
tgggcttcgc 
tgcaagtgat 
aagtggcctt 
taaagaaatt 
ggggcaggtg 
tgagccttta 
tcctctgagg 
tctttgtgcc 
gccatccatt 
gtttttccag 
agctgaagag 
ggggacaagc 
agctcactgg 
tctttgggct 
ccctcatcaa 
tccctggcag 
gggtatgtag 
agtatttgtg 
ttcaagcaaa 
ctggcctctc 
gtgctttcac 
tggtcgtgag 
agaggcacca 
ctctatttgc 
atcacttgga 
ctttgtcacc 
tggactcaag 
accatgccaa 
ctgggctggt 
tgtgattaca 
aatgacttga 
ctaccagctt 



tgaggtcggg 
tacaaaaaaa 
gaggctggag 
ttgcactcca 
ttctttttct 
tgggtttgtg 
tatctatggt 
aggggctcaa 
ctcatgtgcc 
nnnnnnnnnn 
nnnnnnnnnn 
cattttattt 
ttattttatt 
tgaaacaaga 
tcttctttag 
tgtattagtc 
gtttaattgg 
aaaggcactt 
ccctgataaa 
agactggccc 
attctgggag 
acatttccct 
tttttttttt 
ttaaccactg 
ctgatgacta 
tctggctaca 
gtaatccatg 
aagtgggcac 
ggtggggcat 
tggatagtgg 
ttggcctaca 
ggccctctgc 
cctctgtgcc 
cagccgctcc 
ggggcgggct 
gggaggcaag 
tggagggggt 
catctgcggt 
cccacttctg 
cttagtactc 
tacatgcctg 
ggccccgcat 
ggaaaggcca 
ggccgacagc 
aacccctctg 
ggtgatggcc 
aggctccctt 
gaggcctggg 
catggtgcag 
tgtgtgggga 
atcctgagac 
ccctaggccc 
ctctgtgtcc 
tctcctgctc 
gctaaggagc 
ctcatttttt 
gaaaagaaag 
caggctggag 
cgatcctctc 
gcttatttta 
tttgaattcc 
ggcgtgaacc 
ttgtgacatc 
cccatgtgac 



agtttgagac 
ttagccgggt 
aat^-gcttga 
gcctgggcaa 
taaaaagtaa 
tgtgaatttt 
tctttatcat 
tcaacagtgc 
aggcatagtt 
nnnnnnnnnn 
nnnnnnnnnn 
tgtttctgat 
tgtaaaacat 
tatattcaaa 
gtaaccattt 
tgttttcatg 
acttacagtt 
cttacacggt 
cccatcagat 
ccatgattca 
atataattca 
ctcttttaga 
gatgtaactc 
cctgggtctc 
tttggattgt 
tgttcagggt 
actgagttaa 
atttgaatga 
gtggggaaag 
ctcctcccct 
gagagcttct 
tcgacggtgc 
tcagtcccca 
ttcctctatg 
ctgtggagac 
gcggggcctg 
gagaccagag 
gccctctccc 
tttcagttct 
tagtcttact 
tcttttcttt 
gtgccttctt 
gacaggtgtg 
ccccagaggc 
gtttctctcc 
ttgtcggctg 
cagaatgacc 
ccttaggcct 
cttcccaaag 
aacagacatt 
ctcaaaggtg 
acacagggtt 
ttacacctgg 
tttgcagaaa 
aaaagtcaac 
atttttaaag 
actaaatgtt 
tgcagtggca 
atctcagcct 
ttttattttt 
tggcctcaag 
actgtgcatg 
aggtgaccta 
cttgaacatg 



ctgcctgacc 
gtggtggcac 
acctgggagg 
caagagcgaa 
aaatcatgaa 
cctctccagt 
tgaatccttg 
actcattgct 
ctgagagctt 
nnnnnnnnnn 
nnnggtcact 
ttttaaagat 
ttacatatca 
gaagtttact 
ttattttttt 
ctgctgataa 
ccacatggct 
ggtggcaaga 
ctcgtgagat 
gttaccctcc 
acgtgagatt 
taaaaggtag 
tatcctgagg 
cattacatgg 
ttccagtctt 
ggggaggggc 
tggtagttta 
tatgtaggag 
gattcgagga 
ttttccaatc 
ttttcctctc 
cttcgctggc 
tcctccctct 
ggagcaacgc 
ggacacactg 
gaggcagtca 
gtggtgggaa 
tctgttttct 
ccagggaaga 
tcctttgcta 
cctggtgcag 
ctgggttctg 
gttggagggc 
caggcttctg 
ccagtttccc 
tggtgtctct 
cattttacgt 
tggctgcccc 
ggtctgtgtt 
ctggagtagg 
tttcctgctt 
tccacagttg 
agggatgctc 
ctgaggctca 
tttgaattct 
cagcaaatct 
ttttgttttt 
caatctcggc 
cctgagtagc 
ttgatagaca 
cgatccaccc 
gccaaaagag 
gtaatcagct 
tcattgaatg 



aacatggaga 
atgcctgtaa 
cgaaggttgc 
actccgtctc 
ataaggggac 
aagatactaa 
gagttcctta 
ttttcataga 
tgcttaagct 
nnnnnnnnnn 
aatttggtat 
tgttttactt 
gacatttaca 
ttccctctct 
aaatataaac 
agacctatct 
ggcaaggcct 
gaaaaatgag 
ttattcacta 
cccactgggt 
tgggtgggga 
tatactgtat 
gtgctctgta 
ctgcagcata 
ttgctattac 
agatgtgtag 
aagctaggat 
gtgaatgatc 
tgactcccag 
ttccttggcc 
tgttcgccca 
cctgacatcc 
ccagtacctc 
ggccttcctc 
gggcaaagag 
ggtgcgggag 
ggatagaaca 
acctggtgaa 
aatggaaaag 
gtgcgtgcat 
cctgcatgag 
tgggtcaaat 
aggcctagga 
cttggaggga 
ttcagagcac 
gctccagttc 
gagtactggg 
aaatctggct 
attcaagtat 
gtggggaatt 
gtggtgagtg 
gccccaggga 
tgaggtcctg 
aagaggttac 
gtgttttgac 
tagaatagga 
gttttggaga 
tcactgcagc 
tgacactaca 
ctggggtttc 
gtctctgcct 
taaacttgaa 
gtgtattcta 
ctcgctaggc 



62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 
62880 
62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
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ctctgtttct 

agaatgaaaa 

gaatataaac 

ttcattgagc 

ctgcaggtga 

gtatatcggg 

cctcactttt 

gtcccaaaag 

taaagacaca 

tatcacacag 

ccttgcactt 

tgtcaactat 

cacgggttaa 

tctccctcaa 

acagggttcc 

cctggtctgg 

aaagttcttc 

ccccaagggc 

ttggggatgt 

ctctcgggaa 

atctctctag 

gtatgtttct 

agtgtggcag 

cctgggatgt 

cacatcctta 

gctggatgga 

agccccttaa 

atctctgatg 

cacatgaggg 

gagaagttga 

cacctgctgt 

gggggatggg 

aatgtttatt 

tgcagaaggg 

aggggccagg 

cgtttaaatc 

cccactgtcc 

tgatctcaac 

ctgtggacca 

gctcaccctg 

atcttctcca 

acacccttcc 

gcagcatact 

atccacatgg 

ttttggcctc 

gtgtggtgca 

ctctgtgact 

tctgggctgt 

gatcacagct 

agttggtgaa 

agtactgaca 

ccagacacaa 

tggctcatgc 

aggagttcgt 

agtgtagtgg 

tgagcctggc 

tgacacagtg 

ttccagaata 

gtggaggcca 

gggtggactt 

atgcctggat 

gggaagatgg 

tctgccctga 

cactctgtta 



ttatctgtga 

gtaactgaaa 

tcggtagact 

aacagttctg 

atgtgatgtt 

aatgccgtac 

cagccccgag 

actttgccta 

caaggacgtc 

actcaaagca 

agccttggca 

ttctttaggc 

attttcaggt 

gagtgcagcc 

ctgggcccag 

tctcagaatc 

ctggagcaga 

catgggagcc 

tttcaagaag 

taattttata 

gctttgggcc 

gaggaggccc 

ctggacccag 

gggcccagag 

actaacttta 

atcagaaatt 

ccactctgag 

gcgattgtga 

tttgctcctg 

gggtgcagga 

ctcattccca 

ggtggaaggc 

gagcacctgc 

tctgaaaccc 

agcctgggtc 

taatggatgg 

acaggacaaa 

ctctcccctt 

agattcccag 

cctcagcttg 

aaaatcatta 

tggtgactgg 

ccaggaggca 

taccagctaa 

agaaccaatt 

gtggaaaaaa 

ttggatggac 

tgtaaggatt 

agcattacag 

cggacaaagg 

cctgctacaa 

gaaactgcta 

ctgtaatccc 

gaccagcctg 

catgcacctg 

aggtggaggt 

aggttccgtc 

ggccaatctg 

tgggtagtga 

atggtatgtt 

tccccaggaa 

gcgagaggag 

gagccttcct 

ccctggctgg 



aatgggcttg 

gtccttcctc 

tctggtccct 

ttgttcagaa 

catgcttgcc 

ttggaaagaa 

gatggttttg 

tcccagcaaa 

tgcgcagcaa 

aaggcatgtg 

gacccttgac 

ctgtttggct 

gaagagtgag 

ttggctagag 

tctcagccca 

acttttccca 

tgtcctagaa 

agctgacctg 

ggatataagt 

gaaaaataaa 

ggggctgtgt 

taccctggca 

aggtggggca 

acttcctccc 

gcagaaccct 

cctagtttga 

tcactagttc 

gaatgtaaag 

tgtttactca 

ggagagatgg 

ccttgctagg 

agggggtggc 

cacgtgtcag 

agctcttcct 

tgagggcgga 

atttggtctt 

aatccagatg 

ccctctttac 

tctctgggtc 

tgaaatccat 

ctcaggcttt 

catttgcctc 

aggtctgttc 

ttcttgttga 

agaaggagcc 

acccttctgg 

cacttctttg 

acaaattagg 

atgatatatt 

aaatgtacca 

cacggatgaa 

atgattccat 

agcactttgg 

gccaacatgg 

taatcccagc 

tgcagtgagc 

tcaaaaaaaa 

tagaggcaga 

tggctaatgg 

aattatacct 

gtgtacagta 

agtaagtggg 

cctgcaactt 

agtgcagtgg 



atattcctcc 

cagggcacca 

tcattggtca 

ttcctggatt 

attcttctga 

agtccctctg 

ttcatcttcc 

acacacacac 

gaaaagaatc 

gaacttcttt 

tccaggggag 

ccgaagccta 

gttgtcatgg 

aactcacagc 

gccactgatg 

tctgtaaaat 

ggttttagga 

cttgaccgaa 

tatttacttt 

gcttctgtgt 

gtgtgtgtgt 

tgagagggta 

ggaaccctga 

ccaggaaccc 

actcctcact 

gtttcaattc 

cccacctgca- 

tcattgcaac 

tgacccttgg 

caggtgggtg 

agagggacta 

aggcaggttc 

gccctgtcct 

tcttcctgtg 

caccaaagtt 

gttaccctgc 

ctagcctggc 

ctcctaccaa 

tgtgtgtgca 

ctggtctgct 

ccagcatgtc 

cacatcatga 

tcgcctggct 

atgaatgatc 

agaaaaacac 

aaatctcagc 

tcagtatggt 

tcgagtgctt 

aaagccaaaa 

tatttgggat 

tcttgaaaac 

ttaaatatga 

gatgccaagg 

cgaaaccccg 

tacttggttg 

caagatcgtg 

aaaaaaaaaa 

aagtagattc 

ggaaggggtt 

caataaaact 

gacttctgtg 

agaagtgacc 

tatttattta 

tgtgatctca 



tctaccccaa 
tagtgtctgg 
tggaatggac 
tcacctcact 
cattcttcca 
caattgcata 
accacctttg 
acacacacac 
tcagttgcca 
atttcaaaac 
atgacctggg 
tatgtgcctg 
cctcagctat 
tctgggaaaa 
ctggatgacc 
tgagatgaat 
atagtgacag 
ggatttctga 
gggcatttaa 
ctaaggcaac 
gtgtgtgttt 
gggaatctgg 
ctatgattca 
ctctgcttcc 
acacaccccc 
tgcccctcag 
aagtgcagtt 
tgcctagcac 
ggaggacggg 
ggatgggaga 
ggaaagaaca 
atccatccat 
gggtgctggg 
gatgtcgggg 
ctagtggtgt 
tcaaaagctt 
attcaaggct 
cagcggggca 
ccagttcctc 
gggatcctgc 
tgagtccctg 
ccctcccacc 
ctaattaatc 
gttgaatgag 
atgggggtgg 
tctgtcactt 
gggagaaata 
ggcatgtggt 
aaagatgcct 
attatttggc 
attagactaa 
aatatcgggc 
tgggcagatc 
tctctactaa 
gctgaggcac 
ccactgcact 
ggaaaaagaa 
atgattgggt 
tcttttgggg 
gttatttaaa 
agaatcagaa 
acgtgcgcaa 
tttattttga 
gctcactgca 



ccgatagtgc 

gtgaaaagta 

cagtgcttgc 

tctgctctcc 

cccctttctg 

gttcagaagc 

aggacctcgt 

acacacaaaa 

agcagattga 

agaagtgtct 

ggaggaagtg 

gatcctctgc 

gcttcctggc 

agaggagcag 

ttggcctgac 

tttggtgttg 

agtcaggcca 

cagactatct 

aagaaaattt 

tactgtttcc 

gtgtgtatgt 

ctacacatct 

ccccgctggt 

tcttcctctc 

agctagaagc 

cagctgggca 

aatcatttct 

atggtaggag 

ggcaaagagg 

atctggggca 

gtgggaggca 

tcattcaaca 

gctataaaga 

tgtaatttcc 

ctattagcag 

tcagcagctc 

gtcactagtg 

gagcccaccc 

tgcgtgggtg 

tcaaaatgtc 

gcacttggtc 

ccttgcctgg 

tgtgcttacc 

tggattcttg 

gggaggtgca 

actttgtcag 

gacatgcctc 

gggttgaaca 

aatgtccacc 

aatcaaaaaa 

gtgaaagaag 

cagggtgcag 

acttgaggcc 

aaattagccg 

aagaattggt 

ccagcctgga 

aaaaagaaat 

aggcctgggt 

tgatgaaaat 

ggaagaaaag 

atgatttctg 

ctctcatcgt 

aacaggttct 

gcctcgacct 



66240 

66300 

66360 

66420 

66480 

66540 

66600 

66660 

66720 

66780 

66840 

66900 

66960 

67020 

67080 

67140 

67200 

67260 

67320 

67380 

67440 

67500 

67560 

67 62 0 

67680 

67740 

67800 

67860 

67920 

67980 

68040 

68100 

68160 

68220 

68280 

68340 

68400 

68460 

68520 

68580 

68640 

68700 

68760 

68820 

68880 

68940 

69000 

69060 

69120 

69180 

69240 

69300 

69360 

69420 

69480 

69540 

69600 

69660 

69720 

69780 

69840 

69900 

69960 

70020 
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gccaggctca agcaatcctc ctgtttgagc tcctgagtag ctgggactac aggcgcatgc 70080 

caccacatct ggctaatctt ttatttattt atttatttat ttatagagat tggggagtct 7 0140 

cactctgttg ctcaggctgg tgtcaaacgc ctggactcaa gtgatcctcc caccttggcc 70200 

tcccaaagtg ttgggattat gggtgtgagc cactgtacct ggcacctcct gcaacttctt 70260 

cctcaagtgg aaccaatgag gaagcaagca actcagagct ttcacaagtt ttgatttcaa 70320 

tcagcaacgg gcttccaatg caacccttct ctcctgtaac cagcctcagt agagaggaac 70380 

tggaggtgaa ttggccccca tcacaccccc acagtgccaa gctgggccct tccatcaggg 70440 

ggagaacaca tgccgtgtaa gggacagcca acagcataaa ataggaattg tgtgatgatc 70500 

ccttttaagc ctattcagcc cagggaagtg catatgatca gccccatttc atagatgaag 70560 

aaagtcaggt tcacccatta gcacattgtg gggctggtat ttaaaccagg tctgtctggc 70620 

tcccaaggtc acattcattt agacattacc tttactttac atttcttctt cttttcttct 70680 

tcttcttctt cttcttcttc ttcttcttct tcttcttctt cttcttcttc ttcttcttct 70740 

tcttcttctt cttcttcttc ttcttcttct tcctcttctt cctcttcttc ctcttcttcc 70800 

tcttcttcct cttcttttct tcttcctctt cttcctcttc ttcctcttct tcttcttctt 70860 

cttcttcttc ttcttcctct tcttctttct tcttcttctt cttttttttt tgaggtgggg 70920 

tcttgctcta ttgcccaggt tgaatgcagc atcatcatac ctaaatgcag ccttgaactc 70980 

ctggccttaa gcaatccccc tgcctcggcc tccaaaagtg ccaagatttc aggcatgagc 71040 

caccatgccc agcctgcatt tattctcttg taagaaagat atcatttaaa acagacgaga 71100 

aaataaagag ggacatgaaa aagacgcatc accattaatt ggaccactca gagataatca 71160 

tggttaacat gttggtatgt tccctcccgt catttgactg gatgtatgtg ataatttaaa 7122 0 

tgatctcata agcttttcct tatgtaatca aatagtagcc aaaaacatga ttttaaatgg 71280 

ctgctcacaa ccccatctcg tggttctgcc acgccttgtt tatccccatc caccccctac 71340 

tccctttccc cttccctgcc tgtgtggggg tcctagatga cggtgagcca gagggcagcc 71400 

ttggtcagca gattggagag tgcaaataat aaaaacactc agaaggcgag ctgttgtcaa 71460 

gtgggcttat cacaaaagag caccttggga tattccagag aatgacctca tacccgctaa 71520 

tcactatcca taatctggtg ctaactgtac tttagctgaa ggtgctggca ggtcctgccc 71580 

aggtgctgct aagaacactt ctattctgtg agaatcagag atgatttcta gggaaaatgg 71640 

gcgagaggga gtaagcagga gaaacaaccc acaggcacag ctctcatctt tctgccctga 71700 

gagccttcct cctgccacgt ggttttgttt gtttgtttgt ttgtttgttt cagatagggt 71760 

ctcactctgt cacccaggct ggagtgtagt ggcaagatca tggctcactg aagcctcgac 71820 

ctcccaggct caagcagtcc tccccaaatt caaagcttgg agtgatggtc ccagtggtta 71880 

tgtctaggag ccctttttcc tgccagcccc tcaggggatt gatgactctc aaatgcttca 71940 

ggtgtgacat gggcacagca gtgagtcatt cctctgacat tctttgggaa gaacattttc* 72000 

catccaggct tccaggcata agatccagtc ctctggtgat aaggagttca cagacaggac 72060 

aatgtctgag tgtatcttaa acccaggacc atggcttgtg ttcacaccag accctccagg 72120 

gattttgagg tgttttgttt gtttgtttgt ttgtttgttt gttttttgag acagagtctc 72180 

tctctgtcgc caggctggag tgcagtggca cgatctcagc tcactgcaac cttcgcctcc 7224 0 

cggttcaagc gattctcctg tctcagcctc ctgagtagct gggactacag gtgtgcacca 72300 

ccacacccgg ctaatttttg tatttttaat agagactgtg tttcaccatg ttggacagga 72360 

tggtcttgat ctcttgacct cgtgatcctc ccgcctcggc ctcccaaaat actgggatta 72420 

caggcatgag ccaccgtggc ccgcccaatt ttgagttttt atgttctaat cccaaacatc 72480 

tgctcacagg cccctcagca tattctttcc tgggtccagt gtcacctccc aggcctgcag 72540 

gctggctaga gcagtagggt gtgtgggaaa gctctgggct ttgcaggcac tgatcagctg 72600 

tgtgacctta accaccctga acctcagttt cctcacctgt aatggaaata ggtaccacgg 72660 

cagtttgttg caaggactag agagtaacct tgggaataaa aggtagcagc agcttgggct 72720 

ctggagatgg actgtccaag accaacttcc agttcctccc cacacaagct ctggcactta 7278 0 

gattcctggt acctccgctg cttcatctgt aaaatggagt aacaatagga atactttata 72840 

gagttgtaag gattgagtgg ctggatgaac gtcaagcact tcaaagggga cctggcatgt 72900 

agtgagtgat caatataaac cacctggctt gtagcaggtg tgctgtgtgt ggctgcaggt 72960 

gttattagta acatctgtgt gcccttcaga gcgtgcacca cacttcacac cttgtggagt 73020 

ctggaatgcc actattatag ttcaggatag aaaacctccc tgcaagcact cgctttagct 73080 

tgtctccacc gaacaaaaca acacaagttc tttattactt ggaatgggaa aacttcaaag 7314 0 

gcaaaaaaaa aaaaagactt tcgagttacc ccaaatctta agccaaagtc aatgaaaaat 73200 

atcaatcttc atattcaatt tttgcgatac ttttgtctcc ccagcagtca atggagagaa 73260 

tccaagcaca cagaaatgtc aattaccagg ggcagggcta tgaattcctt tcagagccct 73320 

gggctgggga agagtgcagg cagacagatc tgggtcctgt tatcacgttc ttagattggg 7338 0 

tgtccttgta ggagtcatga agcatcttag tgcctttgtt tgctacctat aatgcctacc 73440 

tcagagagta ataaggataa gtaaggctct acgtgaaaag tgctcggccc tggcacatag 73500 

taggtccttc attaatggca gctactaatt tttattacat acgcaaaatc acattacagg 73560 

tcaagtacgc tacatgacag tgaaacagtt tttttgtttg tttgttttga gacagagtct 73620 

cgctctgtca cccaggctgg agtgcagtgg cacgatcttg gctcaccgca acttctgcct 73680 

tcaagcaatt ctcctgtctc agcctcccga gtagctggga ttacaggcat gtgccaccac 7374 0 

gccagctaat tttttttggt atttttagta gagacggggt ttcaccatat tggccagact 73800 

ggtctcaaac tcctgacctt gtgatctgcc caactcagac tcccaaagtg ctgggattac 73860 
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tggcatgagc 
ctacccaacc 
tttctgtact 
ttgtcacagt 
ccattaggct 
ctgtatcctg 
catgagagag 
aatcccagct 
aatgagcact 
gagttctttt 
ttccaagcct 
gttgggggca 
cctcctcaag 
caggagactg 
agaatggaac 
ttttaacatc 
ttacagggta 
cacaggggaa 
acttacctgg 
gtttaatggc 
atacttagca 
aatttctgcc 
aaattgcttt 
ttccagaact 
ggggaaacag 
cctgattagt 
tgatccttgg 
ggcattctgg 
nnnnnnnnnn 
nnnnnnnnnn 
tgtctcaaaa 
caggctggag 
tgatcctcct 
cagaccatta 
atggataaat 
agtcctgata 
atcacaaaag 
ctaactttta 
ggggcagagg 
ataaagaaac 
actgttaaat 
aagctgtttt 
agaagcagga 
tgtcagacct 
caccagcacc 
gctcatgtct 
gtgttcgaga 
tagccagacg 
aatgcactcc 
cataggggac 
tcatagggga 
actcttatgt 
cctggctgcc 
tgggagagtc 
tgcggcagat 
tccacccctc 
ttgaaacgag 
ctcacagcac 
gctcaatgtt 
tcaaatccaa 
gataaaaaga 
gcacagctct 
gcagtggtaa 
ccaaagagtg 



caccgcacct 
tatagctaac 
gttacctagt 
tgcagtttaa 
ttcatttggg 
agaactttgt 
agagagagag 
ctgagcttct 
tatgcagaag 
ccaagctcca 
tgctttttct 
ccatgtttca 
ccctgcaaca 
gggggatgag 
cagggtcttt 
cgaggaccac 
agttacatta 
gtgagtcacg 
aaaatgactt 
acttgtgtcc 
cctagaaaca 
taagatctca 
tcttcattta 
gtgcaggacc 
caagaagatt 
gtccttgcta 
ctgaaacttc 
gcagatgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
aaattaaaaa 
tatagtggca 
gccttagcct 
ttcataatag 
agaatggtgg 
catgctacaa 
gctacatatt 
tcacaggcaa 
tgggggtgag 
tgttctgaaa 
tatatacttg 
acatatttta 
gtttgaggag 
catcaaaatg 
aggtggagaa 
ataatcccag 
ccagcctggc 
tgctcgcgtg 
agcctgggtg 
agtggcaggt 
cttagatgcc 
gtatcatttc 
aaccccacct 
gccatctctc 
cctggggtgg 
aaggaatcgg 
aagccaggag 
tcccacggaa 
taaggaactc 
gctaaacttc 
gaggggagcc 
ctccttccca 
ctccctgccc 
ttagaattcc 



ggctgtgaaa 
tcctatagtt 
actggttttc 
tgtaccatca 
aactccacgc 
accctactta 
agagaggaga 
agttccctga 
aggctcagaa 
ccgtttgctg 
atccctaaca 
ggcacttagt 
tctgtgaggt 
ggaactgcca 
tattttttta 
attctttgtg 
aacaacgtat 
gacaagagtg 
aaccattccc 
tagagttgtt 
cttcctaaat 
tttcagggag 
ttcacttaca 
agagaggcac 
gctacaatga 
cctttctctg 
gatgaagaaa 
tgacatgtga 
nnnnnnnnnn 
nnnnnnngca 
aaaaaagtcc 
caatcatagc 
tgaaataagc 
ccaaaatgtg 
ttgatccata 
ggatgaacct 
acaagattcc 
agctatgaca 
gcgagtgagt 
tggactctgg 
aaatgggtga 
catatttaaa 
gttctaaata 
tatgatgtaa 
ggcaggaaga 
cactgcggga 
caacatggtg 
aacccagggg 
acagagtgag 
gtcaagggca 
atgtgggtcc 
atgtcaggcc 
cctccagccc 
ggtgccatct 
agcagccgct 
ttttcttcca 
caacgagaag 
tcgatattat 
accgaagtca 
cttccaagct 
caattccatt 
gaataggaag 
caaggcagct 
cactcccagc 



cagttttatt 

ccctcagttc 

atagcaccag 

taggatttta 

aggcagcagc 

gcacagaatg 

gggagagaga 

tggtgaggac 

aatttctcct 

gcatgcaaaa 

taggacaata 

agattattgt 

ggtcatcctt 

aggaggtcca 

tttttttatt 

ctttccaaat 

gtaaatggct 

cagccgctcc 

aggatcagct 

tagaaggttg 

attagttgct 

cccaactcaa 

cagcaaacat 

aggtgaagga 

ggtgggaaga 

ggagagccaa 

aggaaagagc 

caaaatatgg 

nnnnnnnnnn 

ctccagcctg 

agaagaacat 

tcactgcaca 

ttggattaca 

aaaacaaccc 

caatggagta 

cgaaaacatt 

atttaaatga 

ggaaatagat 

actgctaatg 

tgatggttgc 

cttgtgaggc 

aatgcaggtg 

ttgtgtggtg 

tcttcaagaa 

agttacacaa 

ggccgagctg 

aaagcccgtc 

gagaaggttg 

actctgtctc 

ggcagggtct 

ctaatagccc 

tcttcttccc 

ctgtcacccc 

gttgcatctt 

gttcagagca 

tagccacatc 

gacactaaca 

tatccccatt 

ccaactgata 

tactccacaa 

tccacccagc 

atacccatca 

gcagacagcc 

ttcggggcca 



gtgtttctgt 
tcagctcaga 
gtacctctct 
aaaatattca 
tgtatatttt 
gaggctcagt 
gagagagaga 
tgtgatgtgt 
catggccaac 
tttggactat 
ttcagcattg 
accaccacat 
aacaactcac 
gcttatgggc 
tttatttttt 
catcacctgc 
ttgtgctagt 
attcggatcc 
gtttgtctgt 
aataatatgg 
gctgttgtta 
tctttgacaa 
gaattgagcc 
agcaaggctc 
gggctggact 
ggcaggcttc 
gcagtggtta 
gtgatcannn 
nnnnnnnnnn 
ggtcactgag 
ttgggtctca 
ttcaaactcc 
gatgagccac 
aaatctccat 
tttactcagc 
atgctaagtg 
aatgttcaga 
gagtggttgc 
gtacagagtt 
actactctga 
atggaaatta 
gagggatgag 
ggtactgagg 
agttgatttt 
ggggtaggcc 
ggtgggtgac 
tctactaaaa 
cagtgagcga 
aaaaaaaaaa 
ctcctatctc 
tccacttggt 
aacccaccca 
ataattgggg 
tacagataac 
gtgatcaaga 
aggtgctgtg 
tttattaagc 
ctgaagacca 
aaagtgatgg 
cacagaggtt 
tcctgaggcg 
gaggcaagtc 
tatggctgta 
ctcacacaag 



ggaatgtgtc 

tatcccttcc 

ggcatagagc 

gttgtgtctt 

gtattgccta 

aaatactgga 

ttcaacctac 

ctcacacggt 

ggaagactta 

cacttaagtt 

ttgtttgttt 

ttcaattggt 

agatgagcaa 

agcagagcca 

aaccagggtc 

cccatgcaac 

tattcaccac 

tggctctgac 

aatttaggta 

agcacttaac 

tcgttattaa 

gcttaaacaa 

tgtactgtgt 

tggctctact 

agagagaagc 

ctggaagagg 

gggaggaaag 

nnnnnnnnnn 

nnnnnnnnnn 

tgagagaccc 

ctctgtggcc 

tggcctcaag 

cacacccagc 

caactgacaa 

aataaaaaga 

aaagcagcca 

ataggtaaat 

ctagtgcttg 

acttttgggg 

acatactaaa 

tatcttaata 

ccctctaaag 

catataaatt 

aaagaaacac 

aagagtggtg 

ttgaggtcag 

atacaaaaat 

agatcatgcc 

aaaaaagtta 

caggataaac 

tcttgcagcc 

gccatcccag 

ccaggaggca 

catggctgga 

cctccccatc 

caggaaggag 

actgcagact 

ggcaactgaa 

aagctgggat 

ggggaaaggg 

gagcttgtca 

ctagacacca 

gttactgctc 

gtgattgaag 



73920 
73980 
74040 
7410O 
74160 
74220 
74280 
74340 
74400 
74460 
74520 
74580 
74640 
74700 
74760 
74820 
74880 
74940 
75000 
75060 
75120 
75180 
75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 



46 



BNSDOCID: <WO_ 



_0229058A2J_> 



WO 02/29059 



PCT/US01/31488 



tggaaaccag 
caaatgagta 
cctttatatt 
aaattcctct 
aggactctgc 
ccactggatt 
gaagaggtcg 
ctaacctttt 
attttcaggc 
cagggagccc 
ggccatccac 
cctgcacaga 
catcatggtg 
catagagccc 
ccagaggtgc 
gtctgcgcgc 
agggccgggc 
gcagagcgcg 
ctagaagctg 
gcccctccaa 
gtatcaggtg 
ggagacccag 
gcagggctcg 
gagggggctc 
gcggatgggc 
cgccgcggcc 
agcgacaact 
agaggggagg 
ggggctttgg 
aagcagagga 
gaaaagaaaa 
gcaagtaacg 
tctcagccga 
ctccggtcac 
tcgcggcgcg 
tctgcgtgca 
cgctgtctcc 
agctgggctt 
ctgcgccccg 
acgcgcacgt 
tacatttcca 
gccctctgta 
cagggccgcc 
agagtggcac 
tgactaccct 
aattcggtta 
aaatgctaag 
tttcagaaga 
ctctcagcct 
taatgagaag 
atgctgagct 
tttacctacg 
aaaatttttt 
cccagaactt 
ctggacaaca 
ggtgtgcacc 
gagatggagg 
aagactctgt 
taaaagggaa 
agacagggca 
atttaaacct 
tagaggtagg 
ccagaaggag 
tggccaaagc 



agactctcca 
atttgaaaac 
catactccac 
cccagcacgt 
ctattagtcc 
agataaatga 
agaagtgagg 
ctggcttcca 
atacctgggc 
cagcccagga 
cagcgcctcc 
aacagagcgc 
gcacccgaga 
tgggcttccc 
acgcggcacg 
tgcagcttcc 
tgctggggag 
ctctgaagca 
gcaggaagaa 
gacctctcct 
acccacgacc 
gacgggctct 
gggcaccagg 
ggacttcggc 
tggcgggcgg 
gctgcagcct 
gggatgagac 
aagtaccagt 
gttcaagcgc 
agtgcagcga 
gggaaggacg 
gtctttcctt 
cgcatagccg 
cggcaaatga 
tccggagtcc 
ggacccggcc 
tcggaagaag 
ggatgaagtg 
aggtgggggc 
gactcgcccg 
ccctggaccc 
tccttgttca 
ttgggtgggg 
gggtttcttt 
gggttaccta 
aaatggaacc 
catagcattt 
cctggtgatc 
ccctgagcct 
taatctcatt 
taatacctag 
taacaaacct 
taaaaaaaga 
tgggaggcca 
tggtgaaacc 
tgcagtctca 
ctgcagtgag 
ctcaaaaaca 
cccctgaaca 
ctttccaaaa 
catagctttt 
atgcagagtc 
aatggcaaga 
acatgtcagg 



caatgccctc 
ctgggggctc 
tttgtgcagc 
ctccttcaaa 
cagggtctgg 
aacagtgggc 
ctgtccaact 
cctgcccctt 
acgtgctgga 
gtacagcaga 
ggtacgcctg 
tgggtgaagg 
cctctcgggc 
agccggtgcc 
gggctcccgc 
tagcaggctg 
aaagtccgca 
ccgcgggccc 
gggcaagttc 
tccctctggg 
ctacagtctc 
cctcggttcc 
ctttgcacct 
aggaagtctg 
gtgtgtttac 
cggcgggagg 
cgaggaaagc 
cacttcttcc 
tccaggaggt 
gcggggaccc 
cgcttggcga 
gggaagcctg 
gagaccctac 
gcagccagca 
cgtgggccca 
gccacggagc 
cgggggaact 
gatctgcgga 
gggggctccc 
ctggtcccgg 
ccatcttctc 
aaacaaagtg 
ttttgaactg 
ggagttagtc 
gggcaagtta 
tacctaactg 
ggcatagtaa 
ggatgcttcc 
cagtatcctc 
acatgtgagt 
gtgatgggtt 
gcacattctg 
gtgatactgg 
aggcaggagg 
ccgtctctac 
gctacccagc 
ccaagatcat 
aaacaagaat 
gatgagccac 
taaaaatact 
cacagaggga 
aggtcggcct 
gcaaaggctt 
ctctgatggt 



ctagagtaaa 
ccagctcctg 
tctcccttgt 
gagctctaga 
atgaccagca 
agagtgccac 
gctgaccctg 
tgccatccct 
atagaagccc 
gaagaagagg 
ttgcagcgtg 
gccccccagt 
cagcccgcga 
aaggagctgg 
tgagccacta 
ccgggttctc 
tctgcccagg 
atgtccggac 
aaggctaccc 
gccgggcgac 
tcgggccaag 
ctcctccccc 
cggaacccgc 
gcggctgctg 
caaagggagg 
agagggaacg 
ggagaggaga- 
agggggactc 
ccgctggaac 
gggaggaaga 
gatgggacac 
gcagtgtcgg 
gcgcgccccc 
gctgcggacg 
gccctgagcc 
ttcagcctga 
gggaacccgc 
gttgatagtt 
ttcttttccc 
acactctccc 
ccccaagccc 
cgattcggct 
tgcagctgga 
agatcgaggt 
cctctctgag 
cccaaaggaa 
gcataatgtt 
agatcaacaa 
atctgtaaaa 
ttaattgtgt 
gataggtgca 
cacatgtacc 
tggccaggtg 
atcg.cttgag 
aaaaaagaaa 
aggctgaagt 
gccactacac 
gactacagaa 
caagccaaga 
gttattaatt 
gaaactgcag 
ggcatttaag 
agctgtggga 
ttaacttctt 



tgaggctatg 
aaaagggaag 
cttatgatag 
cctgaggctg 
ggacacctgg 
ccaatctccc 
ctttctgtcc 
ccccccagcc 
tcgttcttca 
ttctccagca 
ggggccatgc 
ggccccaggg 
ggagcccctc 
ctccgcgcgc 
tcggaaacaa 
tcacccaggc 
tccccagagg 
tctcgcgcca 
tacgacccca 
agcaagccct 
ccaacagctg 
gcgcgcccct 
ttgcccccct 
actttataag 
gaaagagccc 
cgggcagcgc 
agggcaagaa 
ggtattctca 
tctggcaaac 
gaagagtcgg 
tgtgccgcgg 
cgggagccgg 
tccccgccca 
cctccgggag 
gcgccggcgc 
cagcccggtg 
cgggcgccag 
gtatttacac 
ctccccttag 
tctggcacag 
ccagactaac 
gaagccgact 
agcagtgttt 
ctgagtcttg 
cctcagcttc 
tcgcgattgt 
aattgttgct 
ttgattgact 
tagactacta 
gttaagagtg 
ataaaccacc 
ccagaactta 
tggtggttca 
ctcaggagtt 
aaaaaatagc 
gggaggatca 
tccagcctgg 
agctccaaga 
gaggaactaa 
cctcaagaca 
gcttgaagct 
tacggctcct 
atggcacaag 
aaaatgcaat 



taactttgtc 
gatgtggggc 
ccctattaag 
tcagaggctt 
cattcagtga 
cctgaagttt 
acctggccac 
cacccagccc 
gaatgatcaa 
cagccgtgca 
tggccccgag 
aagggtcctg 
atggaggccc 
actagcagtg 
ggaaggtcct 
cagggcgctc 
acagcaaggg 
ggaaagaccc 
tcttccagtt 
ccccctttcc 
ccacgtggag 
cactcactcc 
ccagccccgg 
ggcagcggtg 
cagctccccc 
gggggcgggg 
agacccagag 
tctgtgaaac 
gcgcagctct 
aggggtcaga 
gaccgcgggc 
cctcggtgtc 
cgctgctcac 
cgcaacgctt 
tggggtcttc 
gcctcgcctc 
aggtctgcga 
gcgtccggag 
gtcgagtttc 
ccccagcacc 
atcaggcagc 
gaccgcgatt 
tccgagaggc 
actttttaac 
ctcctcttta 
gatgcaggta 
gctgtcatta 
ccaggtaaat 
tggtgtggag 
ctgctaatgc 
atggcataca 
aaataaaaat 
tgcctgtaat 
cgagaccaac 
caggcatggt 
ctgagctgga 
gtgacagagt 
aggcctcaga 
tggctaccat 
tcatggtccc 
ggagcaaggt 
tccattcctc 
gagttctcgg 
actgcctccc 
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agaacttcca 
tctaagctag 
gtggctcatg 
caggagttca 
ctgagcatgg 
cctgagcctg 
ctgggtgaca 
tactcttgac 
caaggttttc 
acacttgctc 
ggacacaggc 
acctcaccta 
ctcccagtct 
tttttttttt 
ctcagctcac 
agtagctggg 
tcttttttcc 
tcggctcact 
gtagctgaga 
ggcggggttt 
cgttggcctc 
tattttttta 
ctcacgtgat 
cgctcagcct 
cttctctctg 
tactctgctt 
gcccccatcc 
tgatttgtgt 
ttgttgaata 
gtttggaatt 
ctcctggatt 
aggtttttgt 
aggagaccac 
tggctcactc 
aggagtttga 
ttatccaggc 
aatagcttga 
ggtacatgga 
aaacaccgca 
ggggaacatc 
cttagaggac 
aacctgcaca 
agaagaaaca 
ggtgactcag 
ccaggagttc 
aacaaacaaa 
ggctgaggtg 
gccactgcac 
taataaaaaa 
gatttatcaa 
acactactca 
ctagaggctg 
ggctctgcag 
tcttctagtc 
ttatttagcc 
gtaatcccag 
cagcctgacc 
ggtggcatgt 
caggagacag 
agcaagactc 
gtgcagccat 
ctatagccca 
taacacaaaa 
caaacatccc 



gatcaaggtc 
gagagtcatg 
cctgtaatcc 
agacctgctt 
tggcatgtgc 
ggaggtccag 
gagaccctgt 
aaaaatgctt 
atggaccctt 
ttcctcatct 
ctcatttggt 

aggggcactc 

ttgcacctgc 
ttttgacaga 
tgcaacctct 
attacaggtg 
tgagacagag 
gcaagctcca 
ctacaggcgc 
caccatattg 
ccaaggtgct 
gtagagatgg 
ccacccacct 
ccttcttcat 
gaagctctac 
acctctatgg 
cgccccatga 
gcctgctgcc 
aatgaaaagg 
acctctggga 
ttgttttctt 
gtttaattaa 
cataatagtt 
ctgtaatctt 
gaccagcctc 
gtggtggtac 
atctgggagg 
tgaagctgga 
tgttctcact 
acacactagg 
gggacaatag 
ttctgcacat 
aacaaacaaa 
gtctataatc 
aaggccagcc 
caaaatagcc 
ggagaatcac 
tccacctgga 
cagagaagag 
cgttctcctc 
taggttaagt 
cttcacagct 
ctgattattt 
tgttaagtgc 
taacaaatag 
cactttggga 
aacatggtga 
gcctgtaatc 
aggctgcagt 
tgtctcaaaa 
ttcggaaaac 
gcaattccac 
atgtgtatac 
agcagtccat 



aaactcctca 
gatccctttg 
caacattttg 
ggtcaacata 
ctgtagtctc 
gaagctggag 
ctcaaaaaaa 
gtgtgaacat 
tccatgaggc 
caacccacat 
gtgttcattc 
agcacatgtg 
tattccttct 
ttccagctct 
gcctcccagg 
cacaccacca 



tctcactctg 
cctcccgggt 
gcaccaccat 
gacaggctgg 
gagattacag 
ggtttcacca 
tggccaccca 
ttctaatgta 
tctaggctgg 
aagtccccac 
gatgtacatc 
tatcacaggg 
gaatggtggg 
ggccaaggac 
ttggttttgt 
tggagcatat 
cacattttgc 
atcacttcgg 
accaacatgg 
atgcctgtaa 
cagaggttgc 
agccatcatc 
cataagtagg 
gcccgttggg 
gtgcagcaaa 
ggatcctgtt 
aataattccc 
ccaacacttt 
tgggcaacat 
aggagtggtg 
ttaagcctgg 
gcaacagagc 
gaaagacacc 
tacccccaaa 
ataacagcat 
taatggacaa 
gtgtgatttt 
tgaaaataaa 
ctattataaa 
ggccaaggtg 
aaccccgtct 
ccagctactc 
gagccaagat 
aaaaaaaaaa 
agtttggcag 
tcctaaatat 
aagtatttat 
cagctgatga 



gctctacaca 
agaatacaaa 
ggaggctgag 
gtgagacccc 
agttactggg 
tgagccgaca 
aaaaaaaaaa 
ggacacacac 
tctagtggtc 
ttccatggag 
actgctgtgt 
cagtgaagag 
tgtgggaatg 
gttgcccgag 
ttcaagcaat 
cctgtgagct 
ttgcccaggc 
tcaagtgatt 
gcctggctaa 
tctcgaactc 
gtgtgagcca 
tgttggccag 
aagtgttggg 
ctcatccttc 
attcagggcc 
actgatctag 
ttgtgggggc 
cctggcgcct* 
aaaggaagct 
tttcagtctt 
ttgtttttaa 
attggaaagg 
agccataaaa 
gaggttgagg 
agaaacccca 
tcccagctac 
ggtgagccga 
ctcagcaaac 
agctgaacat 
gagtgggggt 
ccaccatgac 
ttgttttaag 
atttaaaaca 
gggaggccaa 
ggcaaaaccc 
gtgcatgcct 
gaggcggagg 
aagattttgt 
tgagatatat 
actaattcct 
tatcacattg 
gagcactgag 
gtccaaatca 
agtatccaat 
tatttaggct 
ggcagatcac 
ctactaaaaa 
aggaggctga 
cgtgccactg 
aatctctgca 
gtcctcaaat 
actacaccca 
agctgtatta 
acggagaaac 



gggggaccta 
agacagtggg 
gcaggaggat 
tatttctaca 
gaggctgaag 
tctcgccact 
gaagaaatat 
actcatcaac 
catggacccc 
ttggactgtc 
atcccagcac 
tcagtcagct 
acagatttcc 
ttggagtaca 
tctcatgcct 
gatatttttt 
tggagtgcag 
ctcctgcctc 
tttttgtatt 
ctgacctcgt 
ctgcactcgg 
gctggtcttg 
attacaggca 
acaactcagc 
ttgtccacat 
aataatcagc 
aggaaccacc 
aataagcttg 
gaaaaggtag 
gcagggtagg 
tgaagggtag 
acagagacct 
aggaatgagg 
caggcggatc 
tctctactaa 
tcaggaggct 
gatcgtgcca 
taacacagga 
tgaaaacaca 
tggggggtaa 
acacgtatac 
aagaaataaa 
ataaaaaata 
cgcgggcaga 
tgtctctaca 
gtcatcccag 
tagcagtgag 
ctctaaataa 
ttccatatct 
tcctaaactc 
gctgtcatgt 
acagggtggg 
ctccatctca 
tcaattcatt 
gggcacagtg 
ctgagtcagg 
tacaaaaatt 
ggcaggagaa 
cactctagcc 
tgaagaatgt 
agttaaacat 
agagaattga 
ttcattacag 
aaaatgtggt 



gagtcaactt 
cgcggtggca 
cacttgagcc 
aaaaattcag 
taggatgatc 
gcactccagc 
gttattgatc 
attcacattt 
catggctgga 
tgctgcatga 
ccagaacagc 
ggtttcacac 
ttcatttctt 
gtggcacgat 
cagcctccca 
tcttttcttt 
tggcgtgatc 
agcctcccaa 
ttttagtaga 
gatccgccca 
ccattttttg 
aactcttggc 
tgaaccactg 
tcaagtttca 
acccaccaaa 
ttagttttct 
acgtggtagg 
cggccaacat 
gctaaaatca 
taacaggaaa 
cgttatcgtc 
taaagcagtt 
ccaggcatgg 
acctgaggtc 
aaatacaaaa 
gaggcaggag 
ttgcattgca 
acagaaaacc 
tggacacaga 
ggggagggaa 
atatgtgaca 
gaaaaaacca 
ggccaggcat 
tctcttgagc 
aaaaatataa 
ctactcaggt 
ctgagatcgt 
ataaataaaa 
gaatcaatag 
tgttctcctg 
gggctcctgg 
tctaaatcct 
tgagcctcac 
atttaatgaa 
gctcacgcct 
agtttgagac 
agctgggtgt 
cgcttgaacc 
tgagcaacag 
acataaaatg 
agagttacca 
gaatatttgt 
ctaaaaagtg 
atacccatac 
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aatgtcatat tatttggcca taaaaaggaa gtactgatac atgctacaac atggatgaac 85440 

cttgataatg ttattctaag tgaaagaaac cagacacaaa agaccacata ttgtatgact 85500 

gcatttatat gaagtgccca gaataggcaa atccacagag acagaaagta gattagtggt 85560 

tgccagagac tggagggagg agataatggg aaatgtggaa tgactgctaa tggtatgggg 85620 

tttcttcttg gggtaatgaa aatgttgtac aattagataa tggtgatcat tgtaaaactt 85680 

tgtgaatata caacatgctg aattttatac tttattatat tttatttttt ttgagacaag 85740 

gtctcgctct gtcacccagg ctggagtgca gtggcacgat ctcagctcac tgcaatctct 85800 

ctgcctccca ggctcaagca atcctcctgc ctcagcctcc tgagtacctg acactacagc 85860 

atgtgctacc atgcctggac aatttttgca tttttagtag agacagggtt tcgctatgtt 85920 

gcccaggctg attttgaact cctggactca agtgctccgc ccacctcagc ctcccaaagt 85980 

gctaggatta caggtgtaag ccaccactcc cggcctaaat tgtattcttt aaaagactga 8604 0 

attgtatggt gtgcgaatta tatctcaatt taaaaaaaac aaaacaaaac aaaaaaaaaa 86100 

cctttgcgtg tgtcaggcac tagggattcg atgctgaata agacacagac cctaccctca 86160 

gagaacacag agcccagcag gagagagtca cagatgaatc aagtgttaca tcatctatag 86220 

gaagcgccat ggaagaaaga catggtgcca tgagaacata cgcttagaga agggaatttc 86280 

atctagactg gggctcaggg aggaatcttt cagggtgatg cttgtgctca gagttttcca 86340 

tgtcagaatc agtagaattt atcaatcctc cagaggagga aacagcaaat gaaaaatctt 86400 

acaacaggag gatgcggaga cattccgaga gctgatcaag ggctggtgtg aacaaagcac 86460 

ataggatgca gagcctgtgg tgtgaggttg cagctggaaa ggtaaaacac taattacatt 86520 

ggatcttctg agacaataaa gagtatgcaa taatctcaaa cgaccgaaac tgaccttcct 86580 

cctccctaac ttgcttgctt ccactgttgc ccgtatcata aaagcaccac cctcttctac 86640 

ccagtggctt aagacacgaa actcaagtca tcccaggctt tctccccacc tcactctcca 86700 

catccagcct atcagcgagc ttgtgggtct taccacgtaa agacttctca tctccagcta 86760 

ctaccatccc ccaagcccag atcaccatca gctcaggcct ggactcctgc aacctttcta 86820 

accgggtctt cccaatccta cccccgcaac atgaccccaa tagcccatca gaatggacta 86880 

atcgagatgt agatttgatc aggccacatc ccttgaaagg cttcctgtga ccctcgggga 86940 

aatgcacaaa ctcccaatga tggcccctga gtcctgtgcc atctgggtct gccctctgcc 87000 

ctctgtgtct ttgccatggt aacctccttc acacccatta atactccatg ctctctccta 87060 

cctcaagttc ttcctgggct ggaacattct ctgcactagc' ctagccaact aaccctttag 87120 

atcttttgtt tgtttgtttg tttgtttgtt tgtttttgag acagtcttgc tctgttgcca 87180 

ggctggagtg caatggtgca atctatctcg gctcactgca acctctgcct gccgggttca 87240 

agcaattctt ctgccttagc gtcctgagta gctgagacta taggcaccta ccatcacgcc 87300 

cggctaattt ttgtattttc agtggaggtg ggttttcacc atgttggcca ggctggtctc 87360 

gaactcctgg cctcaaatga ccaccctcct cggcctccta aagtgctggg attacaagca 87420 

tgagccactg tgcccaggca acacttcaga tcttaatgat catttccttt aagtgcctga 87480 

cctcttgtag taactagcct gactccagca atgaatcctt ttgcaatgta acctatataa 8754 0 

catctgagtt tccctttgat aaaactcatc atatatttgt tcctctgaca gttcagaggg 87600 

caagggcctt tgcccacctt cctcaccact atcctctcac cacttaacac agaactcacc 87660 

acccaccatg cctcctgcct gacaaattcc taaccatcct tcaaatctca ctcacctatt 87720 

accttctggg aggcagtctt ccctgagcac caagacaatg ggacacattc ctttatacac 87780 

cctgctgaac atctcttttt tgaggggcgg gtagagatga gtgtctcact atgctgccca 87840 

ggctgacctc aaactcctgg cctcaagcga tcctcctgcc ttggcctccc aaaatgctgg 87900 

gattacaggc atgagccact gtacctgacc gcaactgggt tagnnnnnnn nnnnnnnnnn 87960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88020 

nnnnnnnnnn nnnnnnnnnn nnncctgggc aacaaagacc cttcctctac aaaaaaaaaa 88080 

aaaaaaaaaa aaaaaaaaaa aaattatttt aaattagcca gacatggtag tgcatgcctg 88140 

tagtccaagc tacttgggag gctgaaggga gaggatcact tgagcccagg aggttgaggc 88200 

tgcagtgagc cgtgatcgta ccactgtact ccagcttggg caacagagtg agacctcatc 88260 

cctaaaaata aagaagaaaa tatggcaatt tgactgtaca tctctaatgg gatatatcct 88320 

aaggatgaga aaggaataag gaaggacaga aaaaaggaaa caaagaagta gcaacagtat 88380 

ttagcaattg tattgttatc aagtaacatc aatattggta aaaccagtaa ttatatttaa 88440 

aatactatat atgtgtatgt acatttacat atgcatatgt taggaaccaa gtttatcaga 88500 

ggaagagaaa gggctacaaa tgtaaaatca aggaaataaa aatttgaata aaaatatcag 8 8560 

tattaagtat ttatgatatt tttcttataa aaaaattata tatatgttaa ctctatccaa 88620 

aacccaaaag cagtgacaac ccaggagcaa taaaaaacct cagcatccag actgtagtct 88680 

ctaccatttc caattaaaga aacccagggc tagttgggaa aaatgacaat ttcatgtcta 88740 

gggcaagaaa cacacctagt gaaatggacc tgaacattta attgtgttag aaagtaagga 88800 

aactctctag aaataatgtg atttcatcta aaagacacag attctgggct ggtaaagttt 88860 

tcaatggcca aaggtgagac aatttgagca tcaagaagaa tcatgacaga acagattaaa 88920 

acatgtcaaa tatattttaa aatgaaatat tataaaagaa acaattagta gccatccctg 88980 

aaggtcacta gggcaccaac tcatatttca aactggtaaa taaatgtgta agccaagcat 89040 

ttatttctgg gtaacaaaat agtaaggaat gtttttcttt ctagaagaat tctagtgatt 89100 

aaaagtagaa gatagaaata gaaaatcatc cttttggcca ggggcagtgg ctgtaatccc 89160 

agaactttgg gaggccaagg caggtggatc actggagatc aggagtttga gaccagcctg 89220 
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gccaacatgg tgaaacccca tctctactaa aaatacaaaa attagccagg tagtgggtgc 89280 

ctgtaatgcc agctactcgg gatgctgagg caggagaatc gcttgaacct gggaggcgga 89340 

ggtcgcagta ggctgagatt acgccactgc actccagcct aggcaacaca gtgagactct 89400 

gtctcaaaaa aaaaaaaaga aagaaaagaa aatcatcctt ttgcgatcct aatgaaataa 894 60 

tgggcctagg cattgatcat taatggctcc taaaatcact aagtatatgg ttgatgggaa 89520 

actttatagt ggatggatca gactcgcaat gtctaaacca gttgatcaat cttaacatcg 89580 

taacaagaca acagacacca ggggctgctg acaggagaac agaggaaacc catagctcta 89640 

ccactgagtt attcacggca aaaaaaaaaa aaaaaaaatt aaactgcgtt tcctccaagc 89700 

ttctaatcct gttgtttaca ggaaataccc aaggaaagga atacttttaa atgacacatt 89760 

aaaacaacgc caaatccaaa atatggggaa atgacccagt ttcttcaaca aataaacaag 89820 

aaaaggtagg ggggaggact gttctagatt ttaaaagcta tagaagacac agcaaccaaa 89880 

tacactgcat ggaccaggca tggatcctaa ttggaacaaa ccaactgtaa aaggatgtat 89940 

ttgaaatgat tggaggaatt tgaacagtga ctgcacagta gatgatatga agaaattatt 90000 

gttatttttt aggtgtgatc atgattttat ggtgatgttt aagtaaaaga ggccttattt 90060 

gttagagata catgtacggg tatagagaaa tatttacgga tgaaatgata cgatgtctga 90120 

gatttgcttt gaaaactcta gcaggtgtgg gagaagcagg tgcatgggtg ggggaaggga 9018 0 

tagatgaaat aagtatgcaa aatgttagtc tacttttgtc cctcctgacc cagcaggtta 90240 

aaatacctca gcatacctct actcctccaa ccaggtccaa ggatcaggcc aaaactccct 90300 

gatgtggtaa acagcctgac cccttcttac ctctctctct ccagccactt ccctaagatt 90360 

ccccagtgct ctgtgcccta gccagcccga ctcatctgcc cagattcctc aatgtttcac 90420 

tctctcattc accattttga ccccactgtg ctcctgggcc actctccaag gccccgcctc 90480 

ttcatctcct ccctccttac tcatccttca ggtcttggct taggtgccat tccctccagg 9054 0 

aagccttccc tgacaccaat cccatcctca cctagaacag attatgtgcg cttctttgtg 90600 

ccccccatgg ccccctgtgg gtttgcttca cggattataa ctgcctgact acctgccttt 90660 

ctccaccctc tagactgaga acaccttgag aaaaagaaca catctatctt gtctgtcatt 90720 

gaatccctgg tgtctggcac catggctgac acataactat cactcagtga ctagtgtttt 90780 

aatgaatgaa tgagtgcaac tagacagggt taagaacaaa- agagaagacc aggcatggtg 90840 

cttcacgcct gttatcccag cattttggga ggctgaggcg ggcagatcac ccgaggtcag 90900 

gagttcaaga ccagcctgac atggtgaaac cccgtctcta ctaaaaagac aaaaattagt 90960 

gggggatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc aggagaatct 91020 

cttgaaccca ggaggtgcag gctgcaatga tctgagatca caccactgca ctccagcctg 91080 

ggcaacagag taagactcta tcacaaaaaa aaaaaaaaaa aaaaaaaaaa gagcgagaga 91140 

agatgtcatg gggtaaatga agacctccct tcctggttcc ctgaccagcc cctgccctcc 91200 

cccgcatctc acctgtcttt cttgcttcct tctggtactt ctgtttaagc cggtccatga 91260 

gcaggccatt ccagggggca cacagcactc cgaactgagt gaaggcaaag gcatttgtgt 91320 

aggtgctgac tgcagaagga agagagaggt tggttgatga gaagtttcca aaactccctt 91380 

ccaggcaggg actctcccac cttacccttg tctgcatgtc cctcctcccc acaccatcag 9144 0 

accctcctct ggtgtgtaca gccctgctgg gaggctctgt gttcccagct gggacatgca 91500 

gatgggctac ctcccagccc taccacatac ctcgtgccat gtccccaccg gccatgttgg 91560 

tcagcaagga gttgagagtg ccaatgaaga ggtagtgcca caactgtatc acagacagcc 91620 

acaccaggtg ccaggcaaag cgccgagaga aagcgtagct ccagaaggag cggagttcct 91680 

gcttctgccc tgcccctggg gtctctaatg gggagaggag gatctgggcg tgaattacga 9174 0 

ggaaagtgga caggtaggat ggggagtgtg gaggcttcaa tggaacattt cagatccggg 91800 

cccaccttct acccttggct cccagaactc acccggcgtg ctctgcacct gcctcggact 91860 

cacacatccc cacctccctg ctttgtcatg ctggccctac caccttggat gaccctctgt 91920 

gttcttctct attgaaatcc gatctgtctc tcacagcctg gtcaatgaca cttcttgcag 91980 

taataccttc ctgatctttc tcagcgagaa aggtgaaagg aacgacaagg agaggagaaa 9204 0 

gtcagaaggg agaggagaat gagtgtggat actctgttct aacctgctcc tcagcacctc 92100 

cctttctttt gataccagta tcctgagttt ctttgggaaa tcttcctcta ccctaatctt 92160 

catggtccag atgggaccat gaattcagtg ttctgttcct ctctcaaggt taaccaatga 92220 

gatggttcct cctaacaagg cagaccggcc atgagtttag attaggatgg acttaatcta 92280 

aaataggtcc tacaccctgg caagttcaat gtcctccctt gattttggaa gcttcccaga 92340 

accctattct tctttcttta aaataaataa ataaatacat gttttggatc caattgtcag 92400 

atggtaaaaa taaaaacaaa aaaatcaatt ttattctgta tatttaagat atacaatatg 92460 

aggtcatagg atacatatag ctactaagat ggttactaca gttaagcaaa ttaacatatc 92520 

catcatctca catttctacc tgttttgtga caagagcagt taaaatctac ttgtttagga 92580 

aagtcccaaa cacaatgcag tttgatgacc tacagtcttc gtgctgtgca ttagatctct 92640 

aggcctgttc atcctgctca tctgctcctt tttgtccttc gacctgcatc tcccatctcc 92700 

tctcccaccc cgttcttatt tctactgtag ctagctgcgg tttgtgatgt gtgtaaccaa 92760 

agacgcagaa cagagaggaa ggaaaggaag cagtgataga gttgggacaa taagagaggg 92820 

cggacccagg agacctggag aaatgggggc actgtaccag acttagtgca atggcatcac 9288 0 

agaagagggc agaaccgagg agtgggggga agggaaggca acccatggca ggcgggcttc 92940 

aaggggtggg gaagtgatag gatgcgaaat agagaaaaga gggacagaaa agagacgaaa 93000 

gccctggacc ctccattaag tgagagggtt gggaagatgc ctaaggccct ttttctgtcc 93060 
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tgcctttcct 
aaggagaggc 
tgaaaggaac 
ctttgtggtg 
aggactgtga 
ccaccgcact 
ttaaatcaga 
ctagtgattc 
ttcaaatccc 
gagcctcatt 
gatgaaatca 
ttcctctact 
atttccccca 
ccttgtgaac 
aaatctcaag 
atgagcccac 
ccaaacctat 
atcagatgta 
agccagctac 
tttgagatag 
tcctgtgata 
cccacaatca 
gtgcccccgg 
gaagatgaag 
gaaaaagtca 
aagcaaacta 
atggaatatt 
tcccagcact 
catagtgaga 
catacacctg 
gtttgaggct 
gaccttgcct 
catggatgaa 
tagtgcttta 
gggaaacagc 
gtgcacgcct 
tacatgcctg 
agttccagac 
agtggggcat 
tcgcttgaac 
ctgggcaaca 
tgttatgtat 
gttttctaag 
cctggtacac 
acgacccatc 
cagtgccttt 
aaattcatct 
cttggtgact 
accaccgacc 
gtgcagtgaa 
gaattcctgt 
catggtggct 
aggtcaagag 
caaaaataag 
ggagaattgc 
tccagcctgg 
aggaaagaaa 
gtgactctta 
ccctccaaac 
agagatcctc 
ctaagcccag 
cactcagcca 
catggttgcc 
ttgtttttac 



gattctgggt 
ttgggtctgg 
tcctttgact 
ccattcccag 
gccgaacctg 
ttggaatcac 
atgtctggag 
taataagcag 
agctctgcct 
ttttgcattt 
gagagcacat 
ccttgtggca 
atttgaatga 
tgcttcagcc 
aacgcttgtg 
gctggcttgc 
gtgagtgagg 
tgaatgagcc 
ccacagactt 
tttgttatgc 
ggtttgcttt 
ccagatagtc 
ggcatcagga 
gaggccctga 
gtgtcacacc 
agggtccatc 
attcagcctt 
ttgggacgca 
ccctcacccc 
tagtgccagc 
gcagtgagtc 
ccaaataaaa 
ctttaagagc 
gaagaccaag 
aagacctcat 
gtaattccag 
taatcccagt 
cagcctggcc 
ggtggcacgc 
ccgggaggcg 
gagcgagact 
attttattgc 
gagtcctagt 
acatgccccc 
cccacagccg 
tttcccccaa 
tacaaaacac 
ggtatgacat 
tagcccaacc 
ttgctcaata 
tcctgctccc 
catgcctgta 
ttcaagacca 
ctgactgtgg 
ttaaacccac 
gagacaaagt 
gatgaggcag 
tgccaggagt 
ctccaaagct 
ttctctgccc 
agagaagtcc 
tctgtggtgt 
acaactccag 
tctgctttca 



ccctggggga 
aggagagatg 
gtagctccct 
ggcacaggct 
agactcagac 
cttttagcca 
gtgagagcca 
caaagtttag 
cttcctagca 
agaaaatgga 
ggggtctgac 
aatgactgca 
gactgcagct 
aaaagaatgc 
catttctgca 
tggaggatga 
ccatcctagc 
ctgtcaaaat 
gtgaaaaata 
agcaatagct 
accctgtcca 
cttacccata 
ggaaagtgcg 
ggctgatgcc 
cacgttcata 
agcagatgaa 
aaaaaaggaa 
cgaggatcac 
ttctctagaa 
tactcaggag 
atgactgggc 
aaaaaaaagg 
actatgcagg 
acaggaggat 
ctctactaaa 
ctacttggga 
actttgggag 
aacatggtga 
acctgtaatc 
gaggttgcag 
ctgtctcaaa 
atacaaaaac 
acaagccatc 
acccacccca 
gtggagcctc 
cgacagcaaa 
accaatgggg 
caaagagaat 
tcctcccatt 
aattgacaga 
tttccccttc 
atttcagcac 
gcctggccaa 
tggcacgcct 
taggcagaga 
gagactccaa 
ccatctgggt 
agaaaggctt 
tatgggacta 
ccataacctg 
caggttggat 
gctggcttta 
ctatcacatc 
ccttgcgtca 



gctctggagg 
ggttagccag 
gttttcatgc 
atggaaacaa 
tggagggaat 
ctctgatgcc 
ggcttccttt 
gaagcatgaa 
acagaaagat 
gataaggata 
aattagtaag 
aaaatgatca 
ctatttcccc 
agcagaagtg 
ctctttcaga 
cagccacgtg 
atagccagcc 
catctacatc 
ataaatgctt 
aacagatgca 
tcccacccta 
gctgtagttg 
tgctacatgc 
tttttcataa 
gcagcactat 
cagctaaaca 
agaggcaacc 
ctgagcccag 
aatttttatt 
gctgagtggg 
cactgcaccc 
aaaggaaaga 
gccaagctca 
tgcttgagtc 
aataaataaa 
ggctgaggtg 
gctgaggcaa 
aaccccgtct 
ccagctactc 
tgagccaaga 
aaaaaaaaaa 
atcagcagaa 
acctactatc 
agacacaccc 
ctcgtgtggc 
ggccttttgt 
agcttgccag 
ccctaagttc 
ttacagagag 
gtaagcagca 
caactctaga 
tttgggaggt 
catggcgaaa 
atagtcccag 
ttgcagtgag 
ctcaataata 
tctccagggg 
gagtgcctca 
agctagactc 
caggcagtgc 
ggcttgagat 
gtcctccagc 
ctcacaccac 
gggaagaaaa 



tgaggggcca 
cagggctcac 
tcagctgttt 
aagccccacc 
agcatggtga 
caggttgcag 
tctaagatct 
aagagtaggg 
ggctcagact 
tctcagagga 
tgagcagcaa 
catttcttca 
atgccctgaa 
gctgtgccaa 
accctgaaat 
acccaggcat 
cccatgtaat 
tggccctgat 
aacattttag 
ctgctccagt 
gggccaatga 
gggggcagtg 
caggtactgc 
agaagctgca 
tcacaatagc 
taatgtgatc 
atgctggctc 
ttcaagacca 
taattagctg 
aggatttctt 
cagcctggac 
aattctgaca 
gtggttcctg 
caggagcttg 
taaatcagct 
agaggatatg 
gcagatcacc 
ctactgaaaa 
gggtggttaa 
tcctgccacc 
ggttaagata 
gaggcagggg 
ctgtaagctg 
ggcagtagag 
tccccagaga 
tcaaagaaaa 
tcatctccct 
ctcaacagct 
tgacgttgag 
gcaaagtcag 
gagacaggag 
caaggaaggc 
ccccatctct 
ctactcagga 
ccaagatccc 
aaaaaaaaaa 
aaggagggag 
ggggctcagt 
atgtctgggt 
caactgcctg 
ccacactctt 
ttgctgcctc 
aacattcaat 
gtccccttga 



ggaaaggcac 
cttccttcgc 
ccttctcttc 
agcaaggcca 
atcccacatt 
accagaccag 
ctatgtgaat 
caggcccagg 
taacccttct 
ttattgtgag 
aggaatgccc 
cctcctctgt 
tctgggccag 
ttccaagctt 
cacggtgtga 
ccctgtcact 
cctccaaatg 
cagcggaact 
gctgctgagt 
cctcctcctc 
ggggctctgg 
ggtatgggat 
agacagagat 
gaaggagaag 
caaaggatgg 
tatacacaca 
acacctacaa 
gccttgacaa 
ggtgtggtgg 
gagcccagaa 
aatgaaacat 
catgctgcaa 
cctgtaattc 
agaccagcct 
gggcgtgatg 
attacatgat 
tgaggccagg 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
gtaaatttta 
ctggaaccct 
attagggaca 
gagtcctcat 
tcttctagcc 
ttttacacaa 
ctttattctc 
cagtttgaaa 
gtccagagag 
attaaactaa 
agaggctggg 
ggattacttg 
actaaaaata 
ggctgaggca 
accaatgcac 
aaaaagagag 
aacccagaaa 
ctctgcataa 
ggtgactgcc 
tgacctaaca 
cccttccttt 
ataattgaag 
gaggaagact 
atcttccact 



93120 
93180 
93240 
93300 
93360 
93420 
93480 
93540 
93600 
93660 
93720 
93780 
93840 
93900 
93960 
94020 
94080 
94140 
94200 
94260 
94320 
94380 
94440 
94500 
94560 
94620 
94680 
94740 
94800 
94860 
94920 
94980 
95040 
95100 
95160 
95220 
95280 
95340 
95400 
95460 
95520 
95580 
95640 
95700 
95760 
95820 
95880 
95940 
96000 
96060 
96120 
96180 
96240 
96300 
96360 
96420 
96480 
96540 
96600 
96660 
96720 
96780 
96840 
96900 
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atatacactc cctttatctc attaaaaagg actggatcat atgctgacct ccacctatca 96960 

ctagcaacgg gtaaatggat tgccatggtt ggctttaatc aatcaggatt catcccctgg 97020 

gctaagcggg tcactgccca gataaaactg ttcgcaatga ataagacaga atggttgttg 97080 

attgacctct aatagccttg gcaacagttc atcccctgat accccaacat cagccactgg 97140 

gacagctgga caagcctctg tgtctgcccc tgctgtaccc actagccact tgccaccttc 97200 

ttgtccaaac tagaagctca cagcagcaaa cgccccactc taaaggtccc ccagcctcta 97260 

cccaacactg gcccaagcac attatgacca ctgccacaaa agcttgggca agtctgaaga 97320 

aggggcttag cggttacaag ctcaggctct agaaccgaca agcctgggtt caagttccag 97380 

tatcatggct actagctgca gaaccttcaa caagcttttt aacctcagag actcaaatgc 97440 

ttcatctgta aaatgggggt aacacagtac ctacctcacc gagttgatgg agacaaataa 97500 

tgcaggttca caagacaagt gtctggcata tacaagtgcc cagtgaatgt aggctgttgc 97560 

tatatttacc ttaataataa ggaagactgc cgaggaagag tcaaatgctc cattgtacag 97620 

agtgatgatg gtcgaacggt gttggccaaa taggttccca atctggggat gataggacta 97680 

gcctggatca cttatttatt catgaaacag atacttcctg agcacccagc atgtggcaga 97740 

ccctccttat acccaaactc accctccacc gctagagctc ccacctcagc ttgggccaac 97800 

cccatctgag gcagccaatt atagaaaagg gtctctcctt ccctccacct tcccgccacc 97860 

ctgccgagtg cctgggatta gggaaggctc ccacctgcag gttggtgatg agaaacagga 97920 

ttcccccaat ggtgagcatt ggcatggcca ggaagagcag cacggctgag cctggaccat . 97980 

caaagtcaga ggtagggtgg tgtcatagtg caacccaaac atggagcccc aaactctgct 98040 

cccacctgct ccaaattccc aacaatcctg gtatccaggc cccaattcta gccagcgttc 98100 

cagcgtcctt caagggtttt taggataccg gccaaggctt ccccagatat ctctgtggaa 98160 

gtcttctgag ccaccttctt cccaaccaaa gttggtcctc agtctgtggc aggccaggaa 98220 

gtctacagac agaggcagag ctctaagtga agccacctct ctcttccctc agtaaaccac 98280 

aagctgcctc tccctttcat ccttgacact cctggaaaag aagaccctgg actcaggtcc 98340 

ctggctcaac cctctagccc attccctaat tcatggtatt ggccttgagc ttcaatcatc 98400 

tgtttaatgg gaacaacagt tcctgctctt cctgtctcag gtgctatgag aactgagtga 98460 

gaaaaggacc atggtctttt ctttgttcac taaactctga gcacttcttt ggtgccaggc 98520 

attgtgcttg gcactggaaa tgcaagatga atcagatagt ccttgccctt aaatagactg 98580 

acatgcaaac aaatggttat aacaggtctg gtaagtgtga gaccacagca aaaaagctca 98640 

agagctgggc taggggaacc cttgacaaat tcttcctccc caaaccagac ttctgcccac 98700 

cattattctg gccacaacct atgcctgtcc tattatttgc taaaatgttt taagttgact 98760 

cacttttatc caaaaagtat ctatttttaa aggacacttt atatcactac tgtagatgaa 98820 

aacactggca ttacttgtca tgaatagaaa gtaactgtca aaataaatac aatgaaagga 98880 

aaacaatgtt attcaattgt agctggatgc atttgacctt agaatgttca aagcctaaga 98940 

cctgctcttc ccatcagtgt taaaatcaca ctggccccac atgaagacat tctttcatga 99000 

aatcagaagg actgaaagag aaataaaaag ggaatagctg ttctaccagg tgatttgatg 99060 

tttgttagtg tagttcacgt agtatgcgtg tgcccctaac atcctcttaa ctaccgtgct 99120 

ataccttaag aagcactgcc aagagctaat tttagagtat tcacacagtt taccattcaa 99180 

tttctgtctt tataaaatgt acatctctcc tactactaaa ggttggagac tcctttcaca 99240 

atagagtcct tatgggctca atgctttttt caaaactgaa aagccctata ttatggagga 99300 

agaggaggat tgttgctcag acgatttgca ggcacgagtc aaacattacc cagccaccac 99360 

ctccacattc agttgcttaa aaatcattta caggctttta gagtagatga tgctggtttg 99420 

ataaggagag tggtttgaaa taattggttt gaggtgctgg gccatctcat gagatctgtg 99480 

tgaacaaaga cactcagcct ctgtgtttgc ccagcatgag tgcagacaat ctcatgatgc 99540 

tgtcagcttt agcatagctt acacacacaa gagtaatgta ctttctttcc taaaccaaaa 99600 

attgagccac gggtctaaca ctaggaagga atattgggag gcatctcgtg gccaccataa 99660 

ccaaggcaat gacagaaaga agagtgaggg atcaggaggc ctgcacatca ggcccacctc 99720 

ccacttgctt tctctgtggc catggacatg tctttgcaag gggtcctgct gtggcttcag 99780 

tttctcctct gtgtaatggg tggaagggtg gtggaaaata aaccagattg gagttccaga 99840 

cttaacagac tggtgaaatt ttaaaacaaa gattttgagt acaatagggt tgtcaacttt 99900 

taccctgcta agtaaggata tttgcaaaat ggtcattcat ataatcattt cattaaaaag 99960 

agaaagagaa cattttaaca cataggagaa ggatgtaaag gttttttgtt gttgttttgt 100020 

tttttagggt tttttgtttg ttttttggta gagtctcact ttgtcaccca ggctggagtg 100080 

caatggtgtg atcttggctc actgcaacct ctgcctcctg ggttcaggca attctcctgc 100140 

ctcagcctcc tgagtggctg ggattacagg cgcgcaccac catccagcta atttttgtat 100200 

tttttttagt agagaaggga tttcaccatg ttggccaggc tggtcttgaa ctcctgacct 100260 

cagatgatcc acccgcctca gcctcccaaa gtgctgggat tacaggagtg agccactgca 100320 

cctggccaga tgtaaagttt tgaataaatt ctactctctg aagtaatccc tctccatcat 100380 

ccttgctttt cacattttct caataaactg ttttcacaga ccagcaatag ctcaagatcc 100440 

ttccaggatt ctttcaagct gcagatctct gaataaccat gtggtctgta tatcttgcct 100500 

atagccctct gctcacacct gccccagcca ccaggtgcct ctgagcttgc atccctccca 100560 

cccacctgac agcactcacc tgcagaggtg aaggctatga tgagtgtggc ggtggtgtag 100620 

aaaaatctga aacacataac aggaaaagca gaatattgtc aaggagggag aaacctggga 100680 

gaaaaaacat gattctgctc agccagccca caagtsgtagg acttgaccgc accctcagcc 100740 
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gcccagctaa 
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cctgggaggc 
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tgattctggg 
gcatcggcat 
atagtagcat 
tgctcaatat 
cattttggga 
catcatggtg 
gcgcctgtaa 
cagatgttgc 
ctctgtctca 
gtaggtggat 
cttcagcatc 
ttttgaacat 
agaggatatg 



agatggggtt 

ctgccttggc 

atgctcttaa 

catgagttct 

gtaattcttt 

tctgtgtctc 

aagaggaaat 

cattgagagc 

tgaacactgg 

ttccatggtg 

tggggtggga 

gatactcaag 

actgcggact 

ctctggagcc 

acctctgtga 

tgtacagagg 

gctgcacggg 

ggaagctgaa 

tcttatagat 

ccactctcaa 

aacacagact 

agcaggcctc 

ataatgacaa 

taagagctat 

gaggtactat 

aacttggcca 

ccatgttttc 

ggaatcctgt 

tgatagtata' 

tccctcccct 

cacactaaac 

cactgagaaa 

tctttcagga 

tacaaagtca 

tataaagcct 

ctctcctttg 

catttatcat 

atttaataga 

atactagttg 

cttccttata 

ggtttgaatc 

agtttcccct 

ttcaatgagg 

nnnnnnnnnn 

nnnnnnnnnn 

gagacagggt 

cccgcctcgg 

acacctttta 

tgtgggtgaa 

cccacactcc 

ggcaaaccat 

cagctgttgt 

atgtgtataa 

attattaaat 

aagaacatat 

ggctgagatg 

aaacccccat 

tcccagctac 

agtaagccga 

ataaaaaaaa 

gttctaaaat 

atatgaaact 

ttctagaata 

cattttacat 



tcgccatgtt 
cccacaaagt 
tctgtgccct 
ggaatcgggt 
cagacagcac 
cctgccctgc 
ggatccagga 
tttgcccaac 
acttttgttg 
ctggaggatg 
ggggaacctc 
gcatggcatt 
gtagacagag 
ccctgggtgg 
agcgagtgct 
ctgacattaa 
agagggcaga 
gtggcctgag 
tgtacatgca 
ggaaagagcc 
caatccttat 
aaatgtgtgc 
cagtaatgac 
atgtaatata 
cattatgcca 
agtcactcag 
accattatgc 
cttgatgttc 
tcttttcata 
acccctccct 
aaactcataa 
tgaaaccacc 
aggacaggct 
gagggaaggg 
ccatcctgcc 
gacagtgctg 
gattgtggaa 
actgaaaggc 
tgttgccctt 
atggagggac 
ctggctctgt 
tctctgaatt 
gaatacacat 
nnnnnnnnnn 
nnnnnnnnnn 
tttgccatgt 
cctcccaaag 
aacaaccaga 
ggtggcactg 
tcaaattagc 
tccccagccc 
tacccgtgtc 
ttattatatc 
tgttctgtac 
agacctggcc 
ggtggatcac 
ttctactaaa 
ttgggaggct 
gatcacgcca 
aaaaggaatg 
ttgtgtaacc 
tacaaacaag 
attgcaggat 
ttaatagata 



ggccagactg 
gctaggatta 
acccagcctt 
tgatgggggt 
ccttgattaa 
tcacacagac 
gaaggagacg 
accccaaaga 
tggaaaaagg 
gggaagtggg 
cacttgccag 
agaccagaag 
cactggacaa 
gctgccccgg 
gaacctctct 
gcagggatct 
aaaggcctat 
agtgactcag 
gctctcctcg 
ctatctgcaa 
tccttttaag 
ctttctagaa 
agtcatttac 
tattattatt 
actttaaaga 
ccagtaaatg 
tgaagtacct 
ccttccccat 
ccattctttg 
gggagcttag 
gagactgatt 
cagcagatgg 
tttgggaaag 
agttgatgga 
aaggatcaga 
gatttttctg 
ggctttttgt 
aatttcccat 
tgcagagaat 
agagatggta 
tacttataag 
ggggatataa 
aaagtgctca 
nnnnnnnnnn 
nnnnnnnnnn 
tggccaagct 
tgctgggatt 
tttcattcat 
ggagaagtta 
ctgagtctcg 
tgccctctca 
ctccatgttc 
tatatttcat 
attattttat 
aggcacagtg 
ttgaggtcag 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
tatagacctt 
aatctcctat 
gttgcattga 
aaactcctaa 
tttgtcaaat 
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tgtcttccaa 
tttccccaca 
gaagctgttg 
agagaggtta 
cgatagcaga 
gaatggagga 
ccaccctgaa 
agtatttgga 
gaacattata 
atataatgta 
tttggcaggc 
catagcgaga 
atgcatgctt 
gagtttgagg 
aggaagacca 
aaaagcagag 
ctgttactta 
aacatataaa 
gagataatgt 
ggaaattcat 
ctttgatctg 
aggtataggg 
accaggcagg 
cccaaagggg 
tttatcgcgc 
gcttttacaa 
tttaaaatta 
tagcgataca 
tcagctcaaa 
gtaaaattag 
gataatgttt 
gaggatgatc 
caggctggct 
actgcacact 
agggtatatg 
acaagatttt 
ctaggaggac 
gtctcaaaaa 
acgtgctcac 
tcatcacagt 
tttgttttta 
tcttcaagta 
catattctcc 
tgcaaattct 
tttctgcata 
aactgcagtt 
gcctggagct 
ttagtcagtc 
tccagagttt 
agtgctgggt 
cacacggctt 
tgtttactct 
ctgttgaagt 
ttaattgctt 
aactacaccc 
gtcactggcc 
tcccccacta 
ccatttgccc 
agtggcttca 
cagaaatgag 
ggtgcctata 
cagagtctct 
tcacggccac 
ttacccactc 



agtggtcgta 
ccctggagag 
gccgttatat 
gttagtgggt 
ggagggtgac 
cttaaaatat 
tgtgccagat 
tgggagaaat 
cattctatat 
tcagtaaaga 
cagggtggga 
ctctgtctcc 
atagtcccag 
ctgcagtgag 
tgtttctaaa 
tggtaagaga 
cccatcgtgt 
atagaagtaa 
atataaagct 
tgttgtagtg 
gatttggctc 
tggtctgagc 
tgctgataag 
ggtgtacaaa 
actgaaaaac 
aaaatgctaa 
ctaagtcata 
ttaatgactg 
ttctgcaatt 
gcaggagggc 
gtttttatct 
tggctgtgac 
gggcaggtgt 
tggtcaaaga 
agcacctgtt 
attgtgcatg 
tgcttgagcc 
aaagaaagaa 
ttctctcttt 
gagcacaggc 
cagaactcat 
aaaggaaaac 
agactttgtc 
tttatcctga 
ttctttataa 
ttcttaacca 
tttttttgtc 
aaaaagtaat 
tgcacactgc 
gatggtgttt 
tcatttctat 
ctgcataccc 
cttgaggctt 
ccagccagat 
tggacagatt 
tactgtgttt 
ccagtagcag 
cacctcctac 
acccacagaa 
aaaagcgttg 
aggaaatggg 
tgccccatga 
tctaggcagg 
tgctgaacac 



ccaattaaca 
atgaaaaatt 
aaccctcgtc 
gcaaacatac 
tatagttaac 
tccaacacat 
cttgtttgtt 
gataaatgct 
atgtaacaaa 
gagggctggg 
ggatagcttg 
acaaaaaata 
ctacttggga 
cctacgactg 
agaaataaat 
aaggactttg 
gataagcaaa 
aaatcatacc 
tctggaataa 
ggcagccttc 
aggcaagacc 
ttccccagca 
accggaatgg 
acgatctcgt 
aaaaccaaca 
agtaggcatt 
gtacataatg 
aagttcttta 
acacaggctg 
taacctcgct 
cacagagttg 
acctgtcacc 
cccctttctc 
ggaagacctt 
ctgtcctgcc 
gtagcatgag 
caggagtttg 
agaaagaaaa 
ggaatgtagt 
taaagcatct 
tatgttgtga 
cacccatcat 
aacatgaata 
cttttataaa 
ttatcatttt 
tttcactgtc 
tattgcatta 
aatattttca 
tcccagagat 
ataaacttct 
ctctttgatc 
tcttgggtga 
gttttataaa 
ttggccactg 
aaattaattg 
gatcctatag 
gaggctccat 
aataccccac 
tttaagggga 
ttcacatggg 
ggagggttgg 
tagagggttc 
gcttcccaat 
agatgttacc 



ccccgacctg 
tatgggccca 
attaataagc 
aattagatag 
aacaatgtat 
agaaataata 
cttggaagct 
tgaggtgata 
atatcacacg 
cacggtggct 
aggccaggag 
aaaataaaaa 
ggctgatgca 
caccactgca 
taaataaaat 
gggctcaaca 
tgccttaacc 
cacttcaagg 
tgcctggcac 
tgaatctgtg 
tggggaaggg 
gagtgaggct 
gaggctggag 
atgactcctt 
tatttaatga 
cacatgttta 
tgtgagccac 
aacatcaaca 
gggttgaaac 
gaatctcaga 
ttgtgaagat 
ccactgatct 
cctcaccact 
tcctgataga 
agaatctccg 
cctgtaatcc 
agaccagcct 
gaataataat 
aagtgtacct 
tgactttatt 
aaataatttt 
tctcccaacc 
cttactttac 
caaatatgat 
gtggctgcat 
tggggaaatg 
tattcttaag 
aggctcttgt 
gtaggaacac 
gctaatttaa 
attaacaggt 
agtagtcatc 
tgtgagcgag 
aggctcctga 
gagaaaatgg 
tgggggcctg 
gtgtccccca 
atacatgtac 
aaggaattgt 
atcacctaat 
agagttgtgc 
tcaaattaag 
gcttccccaa 
catagcacct 



taatgaatga 
ctttggagtg 
ctgggggtgg 
aagtaataag 
tgtatatttc 
aatgcttgtc 
aagcagggtt 
gatatcctaa 
tatcccataa 
cacatctgta 
ttcaagatca 
cgaattagcc 
ggaggattgc 
ctctccagcc 
aaataaaaat 
gtactagcct 
cctgtgtgcc 
gtcattataa 
acagtaggag 
tcctctttgt 
cagagactga 
gggaaaggtc 
cataaggcag 
tatactgtta 
atgattccaa 
aaaattgagt 
agctatcccc 
tacaatgcca 
ccagcttttt 
tgtctagtct 
tcaataaaat 
ccagagttga 
ccgcatgcat 
ggaggaccat 
aaggagctct 
cagctactca 
gcgcaacata 
agtaataaat 
aataaatgtg 
ctataagcaa 
ccaacattaa 
ttcaataatt 
atggtcgcaa 
gttataaacc 
aatattgcat 
gaggataatg 
atcaaatccc 
tatattttac 
agacgtcatc 
taagtatgaa 
tgaactattt 
cacttccttc 
cacttcagag 
gcaggggaat 
gctgagaggc 
aactggggca 
tattagagct 
tcactctccc 
tctgtcctgt 
gaagggatgc 
aaaatgcaac 
agagtctaca 
ccccacctcc 
tgcaccatga 



gagtgccttt 
catggtggag 
ggggggagaa 
ttctaatgtt 
aaaatagcta 
tgcggccatg 
gaacctggtt 
ataccctgtc 
atatgtacaa 
atcccagcaa 
gcctgggcaa 
aggcgtggtg 
ttgagcccag 
taggcaacag 
aaaaagactg 
tgaaccctgg 
tcactttctt 
aaagccaata 
tttaataact 
ccactaatgg 
gggcaactgg 
tgggagacag 
ttcagttttt 
atgttttcat 
ggggattctt 
tgatttaaat 
aaaatcatga 
attccagaat 
tgctaactgt 
gtaaattgaa 
cacaacatgt 
ttcggctgat 
tcctcccgaa 
tcttcagtca 
cagtaaaatc 
ggaggccgag 
gtgagaccct 
cacctgtgca 
atcattgtaa 
taaaagagga 
caaagaacat 
ttcaattttg 
tcagtgttca 
ggtctccatg 
tgactatgtt 
ccagggtcat 
agcagtgaga 
tagattgttt 
caaccttgcc 
atgctatcct 
tccaagtatt 
acctgtttat 
tcaatagaca 
gcatgatcaa 
agagatatgt 
acggcctgag 
tgcggcactt 
ttgcaaatct 
tcacttactg 
catccccaac 
agggaatcat 
gcaactaatc 
atcctagact 
ttgtttgatt 
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agcacctccc 
ctgcagggct 
tcagcaagtt 
aaacaatggg 
ggttaagact 
agcctggctt 
ccagctgccc 
acagcctggg 
cctagcctcc 
ctaggttcaa 
gccaggcccg 
cacttgagga 
aaaatacaaa 
tgaggcagga 
attgcactcc 
gcagcttccc 
atgcacatga 
ccccttggcc 
agcccacaaa 
ctaagtgtgg 
cttcttgccc 
acagggctca 
ctgtttctcc 
atcagccagc 
tataaataat 
aggaggatgg 
aaaaatacaa 
gctgaggcag 
ccattgcatt 
catacataca 
accctaaaaa 
aatttaggtt 
ccaggcagaa 
gtccctccac 
gcctggtgac 
aatctgattt 
tgggatctag 
ctggatgata 
ccttcaatga 
tcggaggcag 
cacagtgggc 
tagtacacgt 
cactccttat 
tgggaggcca 
tagtgagacc 
agctacttgg 
agccaggatc 
gtctcaaaat 
caatgcctat 
accaaataca 
aaacaactct 
ttatctatgt 
agtcaccagg 
tttattgatt 
tcgacagggc 
ttgctgggat 
actggggttt 
gagccagggc 
acactaaccg 
ggaggttctg 
cattgagagt 
gtgccaataa 
atggggtggg 
tctccagaga 



acagtagact 
gtgccaggtg 
cagagtcaag 
actcaacttt 
ggtgtggagc 
tgcagctttt 
ctaacacctt 
aggcaccacc 
caactgcaag 
atttagaatg 
gtggctcacg 
cagaagttcg 
attagccagg 
gaattgcttg 
agcctgggca 
tccacttccc 
atatccaacc 
accctttggc 
gggccgggaa 
tctagaagag 
ccatggaggg 
gggaaggact 
atctgtaaga 
acacgtaagt 
aggccaggcg 
atcacaaggt 
aaattggccg 
gagaattgct 
ccagcctggg 
tacatacata 
tgagacaggg 
ttcttgcctg 
caaggtgtgg 
ctcccagcac 
cagtgggaga 
tcccccaggg 
ctgagaggct 
cgaagatgta 
tgggacactg 
agaccaggct 
agccctagca 
gggtaggaag 
gagaccacag 
aggcaggcag 
cccacctcta 
gaggctgagg 
atgccactgc 
aaataaataa 
ttctccaggt 
gggtcagcta 
atgagctggg 
atttatctat 
gatggcctca 
gattgattga 
tggtcttgaa 
taccaacacg 
gaaaaggtaa 
acaaacccat 
ccaagtgttc 
tggactcact 
agctgcgctg 
acccctcctt 
cgggggcggg 
tcttggcttc 



gtgtttctga 
cacaaaataa 
tgggataggg 
ctaaccaaga 
ttcattaaag 
aagtcatgta 
gcaggggcag 
ccactagtgc 
catcaatctt 
atccagctcc 
cctgtaatcc 
agaccagcct 
catggtggta 
aacccaggag 
acaagagcga 
aaccacagct 
aacatgtcta 
catgggttca 
gtccacttgg 
agatgcagct 
aaactggcta 
ctttttgacc 
aggggacatt 
tcatgacatg 
tggtggctca 
caggagtttg 
ggtgtggtgg 
tgaacccggg 
tgacagagca 
catacataca 
aaagagaaaa 
ttttagaaag 
aactcactgt 
agtaggcagc 
gctgaggagc 
cccacatcac 
ccatttttgg 
gctttgcagg 
aagcccacag 
agaactcagg 
agtgccaggt 
tcatatctaa 
gtgctgggtg 
attgcttgat 
ccaaaattag 
tgggaggact 
acaccagcct 
atgacagcag 
agtcactagg 
ctctgttatg 
aacaagtatc 
ctattcattt 
aacttgagct 
ttgattgact 
ctcctagtct 
agccaccatg 
gcgacttgcc 
caaagcctgt 
tacacagtga 
cggtgggtat 
ttagtaagtg 
ctacctggtc 
tgtcctcctc 
ccgtggggct 



taggtcagca 

acaaagccaa 

aggctctctt 

gaactccagg 

aagaaaagat 

acctttgatg 

ggctggagtg 

aagccgggca 

gcacttggaa 

ttgaagttct 

cagcactttg 

gaccaacatg 

cacgcctgta 

gcagaggttg 

aactccatct 

ccatctcaga 

aggcccaacc 

tgcactggca 

gctttttgag 

tctgggaggc 

gatgagggcc 

agatctaaga 

aatagactct 

aagcaagggc 

cacttgtaac 

agatcagcct 

cgtgcacctg- 

aggcagagtt 

agactgcatc 

tacatacata 

acatgttctg 

ggcctggaca 

ggaagggttc 

acgtgtctcc 

ccagggtggg 

gtgcccagag 

tagcttctag 

actctctaga 

aggagaggtc 

acttttaatt 

agggtggaac 

gaactgaccc 

cagtggttca 

tccaggagtt 

ccaggcgtgg 

gcttgagcct 

ggatgacaga 

atcatcattt 

ataaaagtaa 

ttctttcatg 

gtccttcttc 

atttatttat 

aggaactagt 

ggttaatttt 

taagcaatcc 

cccagcccct 

caaggtcccc 

gctctcgccc 

aggtgacaaa 

gcccagaggg 

agaactcgga 

tcctttccaa 

cttctcaggg 

gcagatccac 



acatttgctg 
agacaacatg 
cactggaagg 
gagctaaaat 
tcacccagac 
aagttatgtg 
caaagggagg 
acctctgccc 
aggaacctca 
atacagaaat 
ggaggctgag 
gtgaaacccc 
atcccagcta 
aagtgagcca 
caaaaaaaaa 
caacaagggg 
acaccctctc 
gaaaggtagt 
attccagggt 
acattccttg 
aaagcagagc 
gcagcactac 
ccccgctaga 
ttaatatata 
cccagcactt 
ggtgaaacct 
tagtcccagc 
tgcagtgagc 
tcaaaataaa 
catacaatac 
acaaccttgc 
ggagccctgt 
tgggtgacaa 
attgactggc 
gtctgaagga 
ctgggaaagt 
tttgggagtc 
acatggagtc 
tgtcccagtt 
tggaccagga 
tgtgaaggtc 
ccagacctgg 
cacctgcaat 
cgagaccagc 
tggtgtctgc 
gggaggcgga 
gtgagacaga 
ttctttctgc 
aaataatatt 
ctttgtttct 
ctcccatctt 
tttgagacaa 
gtcaccaccc 
gaggcagggg 
gcccgcctca 
cccatcttct 
tccctagcta 
attgagccac 
gaggtgaagg 
aagggggatc 
agtccagact 
agccagctgt 
aaaatccgac 
ctagagccac 



agcacctact 
gaccctgaac 
taactccaag 
tctgacttct 
ttgagttcat 
acctctccac 
cactggtacc 
ccaaggcatc 
cctttgaaat 
acagccagca 
gtgggtggat 
gtttctacta 
cttgggaggc 
agatcgtgcc 
caaaacacac 
cctcatgtcc 
caaacatctg 
tcagagaaga 
ccaggataac 
gtcttaggga 
cctctaaagc 
ctctctgagc 
gttactctac 
cccgttgtac 
tgggaggccg 
catctctacc 
tacttgggag 
caagatctca 
taaatacaca 
atggacaggg 
cctttatact 
tcccctcagg 
gtgcagcccc 
tcaggagcag 
atccctagaa 
ggaggcagca 
acagagacac 
caagatattc 
actcagccat 
ttccttttac 
atccgagggg 
ctctgccact 
cccagcactt 
ctgggaaaca 
ctgtagtccc 
ggttgcggtg 
atgacacact 
ctctagactg 
atcagcattt 
tttaagcctc 
atttatttat 
ggtccttcta 
cccaatttct 
tctcgctaag 
gcctcccaaa 
gaataggaaa 
gagagcttca 
cggacctcgt 
gaagagccag 
ttgggtggca 
catccagtct 
tctccagaca 
gctgagccca 
cagagggcgg 
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gccagcactg 

ccagtgtcca 
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tccagcagcc 

gggccccgag 
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tttaaaaggg 
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tgcactcgat 
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tctggtagga 
aagggtgtgg 
aggagaactc 
gggggagggg 
ggtgaagacc 
agactatgga 
gggaccctgg 
gggatggtga 
atggatggga 
agggactggg 
tccacgtggg 
gtggctggtg 
gacagagggt 
gctcctgtgc 
accacggtgg 
cagctctaaa 
accaccacct 
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gtctgggacc 
atcctttcct 
ctgcccccta 
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ttgatagcca 
cttcagcctt 
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ttccttgcct 
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ctgcagctct 
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ggggagagag 
ccctggagtg 
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gttacagctg 
gcagcagctg 
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aggagagtgg 
ctagggagag 
cggaaggaac 
ctgtctctcc 
gggctagcgg 
ggatcagaga 
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tccctgccaa 
tcatcacctc 
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accccacgtg 
aggggtgaga 
tgtcactgag 
acgcggagga 
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ccgcacctgt 



gagatggagg 
gccagagact 
aaagatggag 
gagaaggaat 
aaggaagagg 
cagggaccca 
ggactggaat 
ctagagaagg 
gaggggctgg 
gattgggggt 
aagccctcca 
ggccagatgg 
gaggactttg 
tgccatttat 
ctgcttattt 
cctccgtcac 
ccatcccact 
ccccagccct 
ctgtgcccct 
gggaccactc 
ctttgggtcc 
agtggcctgt 
gagtcctgga 
ccgggatcaa 
atattctgga 
ccgcaactgg 
tgaccctgaa 
gcaatttctc 
gcggaggatg 
gtccctcttt 
tcatgtttct 
ggggattatg 
aggacaaaat 
agccatcccc 
aaagggcttt 
caatgaaggc 
tcccctcccc 
ccgaggtatc 
aattcacaaa 
gcctggaggg 
gtggcaagaa 
ggaaaggcac 
ctcgctccag 
tccttgcctc 
ctttgcacgg 
ttatggagga 
gggctggggg 
ggaaagagaa 
aaagaggagt 
cctgatgcca 
tggggcagcg 
ccatcagggc 
aggatttggc 
acaggaaggg 
cactttgctg 
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gttctaagcc 
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gtggggccca 
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gaatcctggg 
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ggtatcagga 

agaggaggga 

gctgcccagg 

ttcccaaagc 

ggggtgcttt 

gaaggggagt 

ttatccggcc 

tggggtgtta 

ccgcgtcagt 

cctcagctcc 

tccctgaagc 

ccttccagag 

tgcaccagcg 

agttttcttc 

gggctgagaa 

cagtggtccc 

taccatgttc 

caggggtcat 

aagcctagga 

tagccaaaca 

tcctgtcttt 

tggccacctc 

tccaatctgg 

ggggctccct 

tgtgttgtgt 

aggtgcttct 

tgctgacttt 

cctctgtcca 

gagctgtaga 

agccctcaat 

cccctttcca 

atgaaggcca 

aaggagtggt 

agaaaagggc 

agctctctag 

ccccagcagg 

cagtgagagc 

cagtggcaag 

gcccccagca 

aggtgagccc 

gagaagtagt 

aggcaccctt 

ggggaatggg 

gtcagtccgg 

tcacactctg 

ttcttttggt 

aagtgccctg 

tgtgaccttg 

atgcttgaac 

ccaaacctca 

ctgcctggtt 

actcgggagg 
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cacccagcct 
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aaagcgagga 

ctctggagga 
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gggaagctca 
tgggaaaagc 
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atggaggtag 
aatggaaaga 
accggcgaag 
agaggaggag 
gatgagacaa 
gcctgcctgc 
gagctcaggg 
gacagcctcc 
cggacgttgg 
cattctggca 
gcctccccag 
cacctcctca 
agcccgttgg 
cgagaggcct 
actctgccct 
tcctcaagtt 
ctttgtttct 
tcagtgaaca 
tggcctctct 
ctcttcttga 
ggtgaaagtt 
gactcagcat 
acctcaaggg 
ttaaagaaag 
gttccacctt 
tccttcatct 
tcaatgtata 
tggtttcctc 
agaggagggg 
tgaagtagtg 
gacctggtca 
ggtgttacct 
gctctgatac 
ggagtggagc 
ggtgctgagg 
acacactggg 
tcccccaacc 
cctggggcag 
cgctgcgtga 
ggccagaaat 
ccgggtcgct 
tggttggggg 
ctaatgtggg 
gccagctcct 
ccttgcctct 
aggctggggg 
gtgagcttgt 
gtcaagtcct 
gctgggcatc 
ggtaagtcgc 
cagaggaatc 
accctcatga 
cagatgaggg 
agaaagtaat 
tggtgtgaac 
acacacgcac 
agcaccgacg 
gcagcacact 
ggaacaccag 



ctactgattg 
tcagggggag 
aaggaggagg 
gtggagggaa 
tcaggggagg 
cgggaaagag 
tcagaaacca 
agggttggag 
gaagagaaag 
agaggcttct 
cacatttctt 
ccctgggggt 
gagggtgggc 
attctgcagc 
gagtgagaag 
gcccctgcgt 
gcccctgccc 
gtgtggagcc 
ctgctgcctt 
gtgggtgggt 
ccttcttcta 
gagccttggt 
gatacttttg 
tggctccctc 
ctcccacatg 
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atcagagtgt 
gtactagctg 
gcaaattggg 
aagaagttaa 
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cacactcggg 

gagcacctgg 

gcaacctcag 

gccttggaga 

cccaaggagc 

agcctaggac 

cctggctttg 

aggagtcacc 

tatcagctat 

tatgagtaca 

catcagtatc 
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gccaaggtgg 

gaccccatgc 

tgtcacccag 

gttcacacca 
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aggatggtct 
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ggcggccccc 
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acccttcctg 
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aagttgcgct 
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gcctgtcagt 

ctcacgccac 
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agtgacggca 
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gctttggctg 
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ttatcgtgtc 

aaattttttt 
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gctgcaataa 
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cagagtgcag 
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ctcccttttt 

aagaaagtgt 

aaggccaggg 

ctccactctc 

tggatcatat 

gatgcggtag 

tgaggccagg 

tttttttttt 

agtggtgcga 

tcagcctccc 

tttgtatttt 

acctcgtgat 

cgcccggcca 

gctcctcagg 

gctgtgatcg 

aaaaaaagaa 

gattcccagg 

cttagagacc 

gaaaagtaca 

gtgagtccat 

ctccaatggg 

tgtcacaact 

aaatgcctta 

gaggcagaga 

caatagtgaa 

caatggcctt 

ggccacttga 

aggcctttat 

gctggagtgc 

tcctcctgcc 

aatttttttt 

aactcctggc 

tgagccacca 

tggcactcct 

cctcggcctg 

tgctctgcac 
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aaggcctttg 

gagagagaag 

tcactgggca 
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gatgctctga 

aattattgtt 

cccaggaagc 

gagcctccgg 



cacacctggc 

cacagataca 

aaaaccacca 

acccctcagt 
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ttcagtcact 
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agaggtcaag 

atgatgggag 

tagcccccag 
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cttatgcctg 

agtttgagac 
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tctcggctca 

aagtagctgg 
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ccacccgcct 

aaaattttta 
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agaagttttt 

ctgtcccccc 

cgacccttaa 

ggaaagtaag 

ttcacacggg 

gtgactttgc 

ggaggcaggg 

tatagggctg 

aaccctgacg 

aacaggaatg 

ttggcattat 

ggctataacg 

ttttattttc 

aagggtgtga 

tcagcctccc 

tttttcttgt 

ctcaagcaat 

tattcagccg 

gccctggaag 
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gtccctgcca 
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cctgcagtac 

cccccaccct 

tggcgtgcgt 

tctgtctatg 

tctctacctg 

cttcacctct 

ctcactaact 

ccactccgca 

cacgtgtttg 

cagctgtttt 

gcagagccac 

atgggacggg 

tgcaggggaa 

aaatggaagg 

tattgttgtt 

aataacacac 

ctggaagggg 



tgagcagcac 

tacacacatt 

ctaagcaagt 

ctctcccagg 

tcctcccctc 

gtgtgcccag 

aactgccaga 

ctagttccgg 

gccctgcacc 

gctctctggg 

aaagttactt 

aaatcccaac 

cagactgagc 

tttgatacag 

ctgcaagctc 

gactacaggc 

ggggtttcac 

cagactccca 

aaaaattagc 

aggaggattg 

ctccagcctg 

aagtaactgc 

aggaattctg 

ggcccctccc 

ggcactgggc 

cctcagatct 

cccccagggg 

tgctgctggc 

cccccacaac 

ttagtctttt 

aagagatgat 

tttttaatat 

ttgtcccctg 

tattttttcc 

tcgttgctca 

aagtagttgg 

aaagacaggg 

cctcctgcct 

ggtctaggcc 

cccacctagt 

atgctcctgg 

tccccgccca 

cccagcactc 

tttgggtcca 

ggcactttcc 

cgctcgctgg 

cgctgccagc 

ctctacctcc 

cagccccttt 

ccctcctctc 

tctcttttct 

tttaggtccc 

ctgcctctgc 

tgcctccccc 

tacaccccca 

ccttcagggc 
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agccactggc 

agaagcccac 
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aataccagta 

atgctgactg 
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gacattaatc 

cattcttcaa 
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ctacagcctc 

gactacaggc 

atctccctta 

tggcctccca 

ttttaccaag 

aagttctgct 

ccctgcccct 

ccgtgagctg 

agcgactctt 

acctgctcac 

gccacttgca 

agcccgacac 

tcagcagcct 

aggagaacag 

ctggtttcct 

tccccaggcc 

ctcttacatt 

ttgctgttcc 

ctgtctgtct 

atctgtcttc 

tgtctgtctg 

actctgcctt 
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ccctctccct 

ctagttatca 

caatagccat 
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gagggaaaga 
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gccacagtgc 
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gtgcttggca 
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tcatggtctg 
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gggcctgagc 
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tcagtcatta 
tgagacagtc 
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atggctttgt 
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cagcccaatg 
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aggaacatga 



tcctctgcct 

ggtcagtgct 

gagcacccac 

ctgccttcac 

ggaaaaatat 

ccgccaccac 

actctgccat 

tcatctgtac 

gcatgaataa 

tgcttgttaa 

aagtggccca 

gtcatgagtg 

tctcatttct 

tgaggctggg 

caaccattgt 

caacctggtg 

acacacccag 

tggagtacaa 

ggttcaaatc 

tccatttcct 

tgagacgacg 

gcaacctctg 

ttacaggcat 

ccatgttggc 

ccaaagtgct 

agaagttttg 

ttagcacata 

aaatactact 

ttagttattt 

tttctgactt 

ctgcaaactg 

gagttagtgt 

agcacttcgg 

gccaatatgg 

ggtgtctgta 

gtggaggttg 

actctgtctc 

atgttattat 

ggtcagcaaa 

aagtgtttgt 

cttcacacct 

ggtagggctg 

cgcagggtgc 

gaaactccac 

tgagttcatt 

gtgccaggga 

actcaagtgt 

agtctcgctc 

tgcctcccag 

atgtgctacc 

ggccaggctg 

gttggattac 

cagtggcctg 

tcactatgtc 

ctggggtcaa 
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tggtctcaaa 

tacaggcatg 

aagcagatca 

gcaagttatt 

accttggcga 
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aactgaagct 
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gatgatgcta 

tacgagctag 
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gtcttatatt 

ttagtgggta 
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tttgtaaagc 
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acttggactc 
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ggccctgaag 

cggtgtgacc 

atagggacag 

gaagagctgg 

ttgagcaggc 

ctggctccta 

catctctaaa 

tcccactcta 

cctcccaggc 

gtgtcactac 

caggctggtc 

gggattacag 

taaggattaa 

tgggcactat 

acaaattgat 

aggaggaatt 

atttaacttt 

ggaataatga 

atagaaagcc 

gaggccaagg 

tgaaaccctg 

atcccagcca 

cagtgagctg 

aaaaaaggaa 

gaccccatcc 

ggcttgggca 

tctctacgcc 

ggccttttat 

ggctgggaga 

aggtgtgacc 

cagaggacca 

catcaacaga 

ttcaggagag 

agccagatga 

tcttgctcag 

gttcaagcga 

atgcctggct 

gtcttgaact 

aggtgtgagc 

catggttttt 

acccagctgg 

acaattcccc 

agctaatttt 

ctcctgaact 

agccaccgta 

gatctcagtt 

taaccactct 

gcaggggttg 

agtgagtatc 
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cagcttttca 
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taaagacact 
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gtttgttttg 

agtgcagtgg 

catcttagcc 

tctattttat 

taagcaatcc 
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agagtgcagt 
ttggacccta 
agataataat 
gtagcacaga 
gccgatgatg 
cacagagtag 
aattcataaa 
tcacagatga 
aagcagctca 
aagaacagga 
cagagagctg 
ggatgacttt 
aggggcgggg 
ttttaacagt 
ggatttggca 
gaagggaggg 
agaaattctg 
ctctggaact 
tgctcactct 
tgtcctactc 
agggattcca 
tgagtcaggc 
gggtggtggc 
ggacggatct 
agggcggaag 
tgggtgtagg 
gtggcaggca 
gggaacaggc 
agacctcctg 
cagtgaggtc 
ccccagtgct 
aagtcagctg 
ccactggccg 
caaggtcctt 
gacagcgcct 
cctctggggg 
ttcccccagg 
tattgtcata 
cttaacagca 
gaatattgta 
acagctaggg 
cttcataatt 
tctttatgtg 
ccccatttta 
taatggctaa 
ctcatggaac 
ttcatttaat 
tgcccacggt 
ccagtctcca 
gccacactgg 
ctcctgaatt 
aatacacccg 
aggttagagg 
cttctgccac 
tgcctcaggc 
gcggacctgg 
gagcacgtgt 
cagggcgtgc 
aacaacagcc 
ctgcggctca 
ttccagcgcg 
ggccgagacc 
acgcggccgg 
gaggccgggg 



tgttcagagg 

ggcgagtcac 

agtatccacc 

gcttggcaca 

acgatgatac 

acactcggta 

aatcagatgc 

aaagacccag 

tttgattagt 

tgctgt^-agg 

tgaaccggcc 

agcagaggct 

ggagggctag 

ggagagagga 

acagcttgga 

gcaggacttt 

tgggccaaaa 

ttggtctact 

cccccaaccc 

ttgctgacac 

agcctacccc 

tgaggccagg 

agcagggggc 

caatgtggcc 

ccagatggga 

ctgggagtca 

tggggaggaa 

cccctgggtg 

gaacctgctg 

agggagctga 

ctcctctaaa 

atcatcagaa 

tgggccacac 

tctgcactaa 

ctaaggggag 

tggagctgat 

gagctcctct 

gtagccacca 

tttatcatcc 

catactgagg 

aaaaggcaag 

ggaccaaatc 

aaccatcctc 

cagatgagga 

caaacaaaaa 

tcatagaact 

ctctacatta 

ggcatcagtt 

tacttcacca 

cttccctggg 

agaaactctg 

aaagtttagg 

aggcagaatc 

ggtgcacccc 

ccattctctt 

ccaacctgag 

ttcgcggcct 

accgcgcggc 

tggcctcgct 

acgctaaccc 

cgcgcgtgtc 

tgcgcgcgct 

gcagccgcgc 

cgcccccagc 



tctctggagt 

ttcacttctc 

tcatagggtt 

tggtaagagc 

ccatcctaga 

cagctctgtg 

agatgggatc 

gcccagagag 

gtcagagcca 

gaagcaggca 

agtggggagg 

ctcagcccag 

ggctgtgaaa 

aggaccggat 

tggggggagg 

ttgcagagaa 

cctggggctg 

cttctgacct 

ccggcccctt 

ttcataaaaa 

ctggatgggc 

gaggtgcaag 

ggagaatggt 

aagaggaggg 

gggggcgaga 

atgccctccc 

agcgtctccc 

aactggcctg 

accacagtga 

agtcctgctt 

aaagtgagct 

tcccctggtg 

acacccgccg 

tgtggcctga 

tagtaatgca 

gcccctcacc 

tttaaagaaa 

tttattgatg 

aaccctcctt 

aacctgagac 

ctggattttg 

cctgtgtgct 

tggaatcctc 

aacaggcaca 

caaaaacaaa 

ccacaaggaa 

tacagatgag 

aatgacagat 

aaccagaagt 

gaacctgtag 

ccccccgccc 

aaccactgac 

caaatccagg 

cttccctccc 

cttctgtgcc 

ccacctcttc 

gggcagcctg 

cttccgcggc 

gcccggcgag 

ctgggcgtgc 

cagctccgac 

ccgcgaggcc 

ccgcggcaac 

cgatccctcc 



tggaagccac 

tgaggctcca 

gtgacaatta 

tcaatcagtt 

ctgatgagct 

gaatgaatga 

ttaaagatca 

gtgcttggag 

agagctggga 

gggatgctgt 

caagggaaat 

agggagggga 

gtcaagagct 

ttgaaagcta 

aggcaatgga 

acaaaaggag 

tgggtcaaag 

cccgaggtcc 

atcgatcctc 

gaggaacccc 

ctggaagaga 

gagccagctg 

ggtgtcagag 

ctcttggcac 

acaggcagga 

ccaacctgag 

caggcagtga 

agcagagtgg 

tgccctgcac 

ccctctctgg 

gggctgatgg 

agctggttat 

ccccccgctg 

ttaggtgact 

atgtggcttc 

ccaataccca 

agggacagga 

gttgactatg 

tagcctgctg 

tccatgaggt 

aactagggct 

gggcacgtgt 

agaacaaacc 

gagagatgac 

aattaaaaaa 

aggtgttcta 

gaaactgagt 

ccaagatttg 

tctgaaactc 

acatggggat 
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ctcaccaata 

atgctatgaa 
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agggggttat 
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tcgcgggggc 

ggtgaggacc 

tcccgaggcc 

gtgccccacc 

tcccgtccac 

cctccctttc 

ccccctgggc 

gccctcccca 

ggcagccgct 

cagagcgtct 

ctttggggga 

tcaacctcct 

gcacctggag 

tgtttgggaa 

acttgggacc 

ttggagaagg 

cgctcccccg 

gccaagagtg 

caaccctgag 

acagccctct 

gtcacataca 

ggcactgctc 

accaaggctg 

agcgtacaca 

aggtagaagc 

agggaccgag 

gcttgaggta 

gaagagctca 

gcacttggag 

gtggtcgctt 

ggccaagaat 

ttttaagaat 

gggaggctga 

ggcagaaacc 

ctatagtccc 

gactctgtct 

gcctagatgc 

aaatgatctt 

tagactccgg 

ccagggagct 
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gcctgggcag 

ttatcattgt 

aagaggagtg 

acccctccca 

cccgagctct 
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gcacacccac 
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ggatgttcaa 
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ttgtaattaa 

cccagccaca 

cattacctca 

atggaatctc 

aatttgcata 

taaaggtaaa 

ttctagcctt 

catactggcc 

aaaatttctc 

cagagccctt 

gcagcagaca 

accttgggaa 



gccagggcgg 

agcgagggga 

ctgcgctctc 

acctctgact 

ccggggctgc 

ccctcccagc 

tgtcctgact 

gttctggcca 

gacccgcact 

tccaccagca 

tgcctcagtc 

gacactggag 

agaactttcc 

aaaaaattta 

ccgaaagcgg 

ggagccgtcc 

ccatgaggag 

ggtgctggtg 

tggtctggcg 

tccagaccct 

cactcctggt 

cctcagggaa 

ggtttccgcg 

cctgcacaca 

catcagggat 

gaactcgggt 

caatcagaac 

ggctaagaag 

agaagtttgg 

gcacctgtaa 

ttgagactag 

ctagtttgga 

ggtggacaga 

ccgtctctac 

aggtactcag 

aaaaaaaaaa 

tagggcttga 

gagaagaaga 

ttaagggtgt 

cggagtctgg 

gattggaggg 

ggggagggga 

gaaatattca 

tgtgagcctc 

ccaccagccc 

cacacacact 

gtcccacaaa 

tcaatcagga 

tccaaacaac 

ggccatccag 

gggatttgtc 

tctgagtgta 

ctcctccctt 

gtctatccct 

tgcctggaat 

tccctccctc 

tgttatgatg 

acaccttagc 

ttctctcaca 

accagttatc 

agtctctcct 

tcccaaggcc 

gaggctcagg 

gcacagccag 



ggacgcgcct 

gcagatgtgc 

ggccgggctc 

gcggtgctga 

ggctccggcc 

tcctctcctc 

tgtggcagcc 

ttaactcttc 

cctaagggcc 

gagcctttgg 

agggccaggc 

gaatactttt 

tctccactgt 

ttaaaaaatt 

gatgaggtct 

cccgccatca 

gagctgaagc 

gcacctcttc 

cctgaggaac 

gttccttcag 

cacagacagc 

acacagcccc 

tacacttgcc 

gcaaccctgt 

gaaccaggtc 

gatcagaggg 

cttcatcttt 

gacttgggtt 

tagtttgaat 

tcccagctaa 

cctggacaac 

gtggggtgtg 

tcacttgagg 

taaaaattca 

gaggctgagg 

aaaatctagc 

ggaagcaggc 

gaagaaagga 

ggggtgggct 

ggtcttgcag 

acagccagac 

gctgctggta 

tgggccatcg 

aggaagaagc 

agacagaccc 

cgcaccaagc 

caggcaggtg 

cccagcagag 

ccagattcac 

cccaaccccc 

tgagattcct 

cccccattct 

tctctaattt 

tccttcccat 

ataaaagaaa 

ccacccagcc 

tcactcagga 

agagcatcaa 

atctcatccc 

atgtatgtga 

gaagtaggaa 

aagactgcac 

ctctggctcc 

gtttcagtcg 



actgaggacg 

cccggcgctg 

cccagccctc 

gatcgaagag 

ccagtcgccc 

cccggggagc 

ccaagagggc 

cccatcccaa 

cacagcggac 

aagctccccc 

tgaccctgac 

ctcctaagtc 

ggcccctgcg 

ctattatttt 

cagaatgtaa 

aagagcttcc 

cctgcattct 

tcttcatttg 

taagcctggg 

gaaacaagag 

cccaacatgg 

atgccagagc 

tccttggcta 

gggtatgtgg 

agagaagcag 

acaggaatcc 

ctggtgtgtg- 

gggatttagg 

ttggagctaa' 

ttgggaggct 

atatcgagac 

atgtctcaac 

tcaggagttc 

aaaaaattag 

cacaagaatc 

ttgggaggtg 

tgaggttctg 

gaagaaatca 

gggggagagt 

gggcttgtgt 

aagagctctg 

aattagaggc 

gtccagatgc 

ctgaagcaaa 

acggacgccc 

agagccatgt 

tcccctaaat 

atcacctcca 

caccttgacc 

accaatgctc 

ctatcccctg 

gcattttcgt 

cctgcatcta 

tctccccacc 

actgcgataa 

ataccgtggt 

gcctgtaatg 

agatccctca 

aaaccttcac 

aacccaccaa 

acctcttccc 

ctctctgacc 

ctccaagcag 

tctagaacag 



actactgggg 

cctgccaggc 

tgctttgcct 

gccagtgtcc 

caccttccct 

aggccgcctc 

gtgtgtggtg 

ggctggggtg 

accagagggg 

agggagcccc 

ccctgcttac 

taccctggac 

tggtgaagat 

atctactgta 

ggattgcagg 

tggtggctgg 

aggtgaggcg 

tccaggggaa 

gaagacctgc 

cagttctcct 

ctttgggtaa 

aaacacctta 

agtgcccttg 

tctctctctc 

gtttccaaac 

caaattggga 

gaagagaggc 

ggtgagtctc 

gaatctagct 

gacgtgggag 

tgagtctctt 

gtctgtaatc 

aagaccagcc 

ccaggcgtga 

actccagcct 

ggaatagaaa 

tgattctggc 

gcatctaagc 

gagagcagct 

caggctggct 

gtggaggagg 

aggctgtcca 

tatttcagaa 

gccactctcc 

atcacgtgca 

agcacgtgca 

tctgacatgc 

gcgatctcac 

cacacaccct 

tgatggggaa 

aaaaaagcaa 

ctgccagaca 

ttcagttacc 

acctccctgc 

cctcctcgct 

gaccagattc 

gcttcctaat 

gagtctggta 

tccagctaga 

ccgactttga 

cctcctcaga 

atatacaggg 

acggctgccc 

agaatgagca 



gggctacggg 

gcccccggac 

cctgctcctg 

gatccccgct 

ggccttgctg 

tccttgcctg 

gctcagccct 

gggcccccca 

cttttgtctg 

acccaggacc 

cctagtcccc 

actttttagg 

caaaagaagt 

agatttgttg 

gccaggaggg 

aggtggtgtg 

cagtgtggca 

gagctgcagc 

tgtctggtta 

gcaaggagga 

atgtgaacaa 

gcaaacagag 

tgcagtgcac 

agctcctgtg 

aggctagaag 

tgcattactg 

tggggactgg 

atcagactga 

tgggcagggt 

gatcacttga 

aaaaatgttt 

ccagcactct 

tggccaacat 

cggcgggtgc 

gggtgacaga 

gatagagggg 

tagggaggtc 

ctgaggcagg 

ggtcagaaac 

gtgaggaggt 

ggctgctggg 

ggtcatagaa 

cagtgaaagc 

accaaccccc 

cacccacact 

agcacaccaa 

acactgacat 

atgcgcagac 

gagataggag 

actgaggcca 

aattcattca 

cccagcctag 

cagtccccca 

tccagctact 

ggtttcctac 

atctgatcaa 

gcctataggg 

ccaactgctt 

acgtttgtat 

gtgcccccct 

tctcagactc 

gttcttcaaa 

ccgactggcc 

tctaaccgcc 



139200 

139260 

139320 

139380 

139440 

139500 

139560 

139620 

139680 

139740 

139800 

139860 

139920 

139980 

140040 

140100 

140160 

140220 

140280 

140340 

140400 

140460 

140520 

140580 

140640 

140700 

140760 

140820 

140880 

140940 

141000 

141060 

141120 

141180 

141240 

141300 

141360 

141420 

141480 

141540 

141600 

141660 

141720 

141780 

141840 

141900 

141960 

142020 

142080 

142140 

142200 

142260 

142320 

142380 

142440 

142500 

142560 

142620 

142680 

142740 

142800 

142860 

142920 

142980 
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tggggagagg actaggacac 
ttacacccag agtagggggg 
attttcgatc cctaatccga 
aggacactgt ggaagggaaa 
accaggtggc aggcccaagg 
ccagatagac ttgtcagatg 
gcccttctcc aaccccattg 
ctctggagaa acggttttgg 
taagctgtgt gaccttgggc 
tgtttttgtt ttgtaagctg 
tcagtgcaat agcccatggc 
aagttctgcc tccctcttcc 
tcgaaggtag gaactttggt 
aagatccttt ggcctttgta 
tgcaaggctg gtggagtgag 
agtgggcagt ggagagagg 



<210> 16 
<211> 215980 
<212> DNA 
<213> Mus sp. 

<220> 

<221> modified_base 
<222> (1001) . . (1100) 
<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (2123) . . (2222) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (3728) . . (3827) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (5168) . . (5267) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (7481) . . (7580) 

<223> a, t, c, g r other or unknown 

<220> 

<221> modif ied_base 

<222> (8849) . • (8948) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (10375) . . (10474) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (12270) . . (12369) 

<223> a f t f c, g, other or unknown 

<22 0> 
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cagatgataa 
ggacggttct 
gaggaactcc 
tttcccaaca 
tccagctggg 
ttctgaaaac 
cccctgcagg 
gacagtagaa 
aagttactta 
gggacaatag 
aaagtgacct 
agcttttctt 
tggttctccc 
gaaacttcag 
a <jaggaggat 



ggtttataag 
cagccctgtt 
tctccaatga 
gactctgaga 
gtcaggtttc 
tgagcatctc 
agggcaggcc 
gggtaaatga 
acctctctga 
tgccagccta 
acatgcttgc 
ccaaccccaa 
ctttctcttc 
gtgaggaggt 
agatggcgaa 



cccttaagcc 
tccctagctg 
aatacagact 
gtccaggagg 
tatatgaatt 
ctttcacctc 
tgggacagat 
cctagttatg 
acattagagt 
aatcaatttg 
tgttattatt 
agatgtctct 
aggcccaaac 
ggcagagaaa 
gccctagcag 



tctaaggttc 
cgggctccca 
tgggactctc 
ccagggatag 
tttaatgctt 
tgtacatgat 
attcagtggc 
ttcccactag 
tctgtgggtt 
ttgtggggac 
ctctttcctc 
ggctattgct 
tccccacctc 
taagaaagtg 
aggggaggga 



143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143899 
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<221> modified_base 

<222> (13438) . . (13537) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (15902) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (15939) . . (16038) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (18223) . . (18322) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (20974) . . (21073) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (24403) . . (24502) 

<223> a f t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (27574) (27673) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (30892) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (30901) . . (31000) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (34443) . . (34542) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (38205) . . (38304) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (42373) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42386) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (42393) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42461) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (44809) . . (44908) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (51380) . . (51479) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (56740) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (56765) . . (56864) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62818) . . (62917) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (68518) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 

<222> (68534) . . (68633) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (74552) (74651) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (81446) . . (81545) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (88519) . . (88618) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (93791) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified__base 
<222> (93794) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (96565) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied__base 

<222> (96570) . . (96573) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (96579) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96590) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96596) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 
<222> (96602) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96616) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96629) 

<223> a, t, c r g, other or unknown 
<220> 

<221> modified_base 
<222> (96633) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96668) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (96715) . . (96814) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (104447) (104546) 

<223> a, t, c, g, other or unknown 



67 



t <WO 0229059A2_l_> 



WO 02/29059 



PCT/US01/31488 



<220> 

<221> modified__base 
<222> (114521) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (114527) (114626) 

<223> a, t, c, g, other or unknovm 

<220> 

<221> modified_base 

<222> (127063) .. (127162) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (139133) (139232) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (151051) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (153242) (153341) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (164706) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (164708) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (164710) . . (164809) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (182242) . . (182341) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (192158) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (192192) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (198842) (198941) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 

<222> (199437) (199438) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (208276) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (215974) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (215976) . . (215977) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (215979) 

<223> a, t, c, g, other or unknown 
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<400> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 60 

caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 120 

cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 180 

tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 240 

gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 300 

agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 360 

cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 42 0 

tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 480 

tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 540 

cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 600 

ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 660 

ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc 720 

tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 

taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 840 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 900 

tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 960 

ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 1020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 

nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 114 0 

tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 

tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 1260 

acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 

ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 

taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 

aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 

gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 

cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 

ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 

atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 174 0 

cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 1800 

aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag 1860 

cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 1920 

gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 

cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 

catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 

ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 

nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt gtttgcaagg 2340 

gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg tgaaggggtg 2400 

cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa ataggctgta 2460 

atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc aatccctgac 2520 

aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc ttgctaacac 2580 

tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc aagaagatgt 2640 

gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc aggactgttc 2700 

atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa cacacggggt 2760 

atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa gaaagcctaa 2820 

gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat taaaacaaga 2 880 

gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg tcacccatct 2940 

gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat ctccaggcat 3000 

gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca tctatggacc 3060 

gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc attgcttctg 3120 

gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc catgacagac 3180 

cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta ccaggcattt 3240 

ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa cataaacacc 3300 

aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc aatgccaggt 3360 

ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa tcagcaaaag 3420 

cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata gaggtgggag 3480 

gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc tgggtttcag 354 0 

gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg tcacacagaa 3600 

acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg cttcctgatg 3660 

atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca gctcctcacc 3720 

acaccaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 378 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat taatcaaaga 3840 

aaacacacac caccagttag agaaagttaa tcaggccgaa- tggcggcttt cccctgtatc 3900 

caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa agcccgcagc 3960 

caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg gggtgggggt 4020 

gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca caggaaagcg 4080 

tgttcaaccc atcactcatg agcagtcggt accggggtga cctattccct gtagaaatgg 414 0 

gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct ctttgcccaa 4200 

gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa tctgagacaa 4260 

gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca atggattgag 4320 

ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc tttctcccat 4380 

atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta gggaacatgc 4440 

tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa gcagctcaga 4500 

gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga tctggggcag 4560 

agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt aaggtaaata 4 620 

gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct caaagtctga 4 680 

aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat gttttagaac 4740 

ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc aggtctttta 4800 

atcccagcac ttgggaggca gaggcaggtg gatctctgtg agttcgaggc cagcctggtt 4860 

ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat taaaagctca 4 920 

tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca aaacacccta 4980 

tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga ggggcgggga 504 0 

aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg atgtagacaa 5100 

aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac catcttcatt 5160 

ctgctaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac aaaaataaac 5280 

caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc ctggggctgg 5340 

ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag ttcaggtccc 5400 

tgggcttcca agtcaggagg agaccaagga atcaacaagt ctcgactttg gtctctagtc 5460 

ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa gactgcacct 5520 

ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt ctaggttcta 5580 

gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt ctcctgatct 5640 

acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc gtagatcctc 5700 

cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc actctggatc 5760 

caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc ttcattatag 5820 

ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac ttccaagtcc 5880 

atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa ctctgcagcc 5940 

tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac tgtatccttt 6000 

accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt aaatacttca 6060 

cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg gcaattcatt 6120 
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tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat cctcatggag 6180 

gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag atgatttttt 6240 

tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact aagcctacac 6300 

atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg cttttcaaac 6360 

ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag agctataccc 6420 

ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct tatttagatc 6480 

aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg tcttagcttt 6540 

aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac tttcaataaa 6600 

ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca ttcactacat 6660 
actgggacaa gatgcctaac atcatctgca gataaagacg aactactggt gtcaggagac . 6720 

agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg ggtgcccaga 6780 

aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct ttaagacagt 6840 

ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct gggcttgacc 6900 

aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca aatctcaggc 6960 

tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca tgaaaactgg 7020 

gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa ttgcaggcgg 7080 

ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt gctagcggca 7140 

atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc tttactgtct 7200 

gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt atgtgctatc 7260 

tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc tgaaaaagtg 732 0 

gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa gtgtggaatt 7380 

ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta gaggccagtc 7440 

ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn nnnnnnnnnn 7500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 

nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc atacaaggag 7620 

taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc ttttgtcgta 768 0 

aggaggaaac cagacaggtc cagagagatg gctcagagat' taagagcact gactgctctt 774 0 

ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca tctgtacagc 7800 

tacggtgtac tcatatacat taaataaata aataaataaa taaataaata aatcttaaaa 7860 

gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc tttaatccca 7920 

gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt tcttcagagt 7980 

gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca aacaaacaaa 8040 

caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg tgaatggaat ' 8100 

tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg ggttcatgtc 8160 

tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac aggcactccc 8220 

aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg actttttaac 8280 

aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg atagagctga 834 0 

tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga gaaccatttt 8400 

gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga tatgtagagg 8460 

aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac agcctggaag 8520 

atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg acagcccctt 8580 

ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc aggctatggg 8 640 

tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga gtggctccct 8700 

cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc tggctccttt 8760 

actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct ttataaacat 8820 

ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8940 

nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg tctccagttc 9000 

aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc aggggattca 9060 

acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa atgagctgga 9120 

aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc tcaatgggta 9180 

agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca catgacagct 9240 

cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt gtgcagacat 9300 

acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta cacatacacc 9360 

cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc accatacgga 9420 

ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa ttagaagaaa 9480 

gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt acatccttat 9540 

ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac actgtatttc 9600 

tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac acacttacta 9660 

ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt aaacgtttct 9720 

tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga agaaatgaca 9780 

catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact cctcccttgg 9840 

cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc tccttcctga 9900 

acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa gcgcatacct 9960 
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gccgtctttc agggtgttga cgatgaactt gtggacctgg cagacacagt tgctggccag 10020 

ctgccctccc tcgaccaggg tgttcagctg cgtggccagc atgaacgctg caaaagcaga 10080 

gagagagggg ctcagtctcc aagcctttcc ttaacccgaa agctcatcac aaggagaacc 10140 

attaaataca gctgtttaaa actcctccgc cctgcagaga ggaaagcagc atcaatccgc 10200 

cccatgtaaa agtctgaggc tcttcctaaa tggtatctgt ttctcacagt ctccaaatca 10260 

tttttactgt aattctagtt tctggggaaa gacctttctc ggtctttagc cccgtgacta 10320 

gagacaacag gcaaatattc cagaaaggcc cccattttct ttttaaagct tctannnnnn 10380 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngcacat cttgtgaagt gtccacatct 10500 

ttcggtccct cgaatttggg tttcttctgg gacgtggtag catgtgactg tcactccagt 10560 

gcttggagca gcagaggggt caggaactcc aggctggcat tagctgcaga gctggagcag 10620 

gtcctggaga acagaaactt tggttgcagc attaatgaac tagaagaatt tttttgtctt 10680 

ctgttaaata taaatacctc cattatcttc tcataaacag tgttgccttt ttatttaagt 10740 

ttttaaggat caggcacaga gactccatgc cagactacca ctcaaccact gagctacacc 10800 

cccaacttgc ctttctgcta ttttttaaat tgtatcagtg gccaccaaac atggggagag 10860 

gtcagggggc tacgtggagg aattgtttct ctcctaccaa gtgggcccca ggtttcaaat 10920 

tcaggtgacc tggcttggca gcaagcacct ttacccctaa gccatctcat tggcttcatc 10980 

ttttaatggc cccttcccct gctctgaggc aggctctccc tatatagccc tggctggcct 11040 

caggctcgca ggtccaccag tgagcaccag gtttctgctt gtccttacct ccccagcact 11100 

gtggttataa gcatgtgcca ctgtgtcaaa ctcagtcact aagctttgcc aagccatagc 11160 

ccagcccttg agtttactgt ttgtctgtgt ggtgatttgt caaaccactt ttgttccact 11220 

gaggtatttt gtcaagtttg acaaaattag ttgagtatgt aggtcttttt ttctggaatc 11280 

ttctgttata gcttagtctg gtcttgaact catgatcttg cctcaacctc acgattattg 1134 0 

aggattattg agatggacag gctgtgtgac catgctcggc tgtgtgtttt agcatgcatt 11400 

agtcatttga aaaacgttgg ctcatgacac tttacaggtc ttccatgttt gatatgtttt 11460 

atttaatcca aagtaattcc agcaccagag gctgagacag gaggatctca aggtcaacct 11520 

agagatgcat agcaggcggg gccccactcg gttaggttaa- tatcatcact gacttcagga 11580 

gaaaagtctt aagtattggg gactaaaagc aggaggatct gaagttcaag gtcatcttta 11640 

ggaacttagc agacttgagg ccagcttggg cgctgtggga ccctgttttt aaaccagaaa 11700 

acaaattgaa aggaaaaaaa aaaaaagctg gaggaagtga atgtgagtgt tcacatagtc 11760 

ctgtttccac aagaaaacag ggttactttt ggcaacaaat aggtgctttc tttgaaggct 11820 

ggcatttttg tgacttgtca ttggagaaat gatttaatta agacttttct actgagtgcc 11880 

tctgaagagg ctcttttaaa tttagtttaa ttttatctca ttgttagtgt ggtgtgcttg 1194 0 

tgcacacaga aggcagcttt ctagagtctt ttcactctct cctccacagc tcctggagtc 12000 

aaactcaggc cctggctagg caagctctta ggacagtgtt agctgtagct tattaagttt 12060 

ttaagaattt ttataagact ctgtttttct ttctcaggtc atgatacagc aggaaaatac 12120 

atccataaag cccatcctgc aggtcattgt aagtaccggc atgtgtgttt agcataatga 1218 0 

agatggttca cttatagtta attaaacatt ggattggatg gaagacatgt agttttggtt 1224 0 

acttcccaga aacacaaatg cacattcttn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12360 

nnnnnnnnng aattcagagc tgatatgtag tactaactcc tactcaatga atcctttgtt 12420 

cttctattcc ttcattacat tactgttaat agtggtaact atgtaccaaa gagtcaaata 12480 

actcttggac catccaaggc agaaggaagg ctggcaaaaa tgtatgatga tctgggatgg 12540 

gaatgtactt cagtttgtac aggaggccct tggttcattc catttctggc aatgcataga 12600 

cctgtaggat ctcagcactg gtggggggtg gggggtgagg gtgaaggggc gggaggttaa 12660 

aggcagaata gtcataaatt caaagtctgg gtcctggaaa gaggactaaa cgattaagag 12720 

ctttagctgt tcttctagag aacctggtgt gatccccagc acatggtgcc tcacgactgt 12780 

ccgaaactct gattctaggg ggatctgaaa accctcttct gccctctgta gatacagaac 12840 

acacatggtg cacatacata catgcaaccc aaacaaccca tatacataaa atattttttt 12900 

ttcaaaaaga cattcaaatt cttcctcggc tatatagtgt ttaccaaacc tcaaaaacaa 12960 

aacaaaacaa aacaaaacaa agaatcatta atgttttgcc ttcatgtatg tctgcccacc 13020 

acggacatgc ctggtaccca gggagattaa aagaagacat tagctcccct ggaatggaga 13080 

taggtatgat ctaccacttg ggtgctggga acctgggtcc cctgcaaaag cagtaaatct 13140 

ttttaacccc taagctgtct ctcccaacgc ctaaagattc ttgtaacaca gcatgatgag 13200 
cactggcaag catagcatgg taatctgact tcagggcgcc agattttgag cttaatgctt 13260 
gattattaga agtaacgtac tagatttaat gcctggagct tcaagcaaca aaattaactg 13320 
aagaataaaa ataaaaaccc tgccagccat gatggtaatc ccagaacttg agaggcagag 13380 
gcaggtgatc tctgtgtttt gcaaggccag ccacaatcta catagcacgt tgcagtannn 13440 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13500 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncga ataaatctac acatgtaaaa 13560 
agaaattcaa agaaacaaat gccaaataaa tacacatatt gtaataaaga gataattgtc 13620 
taaaaaactc aaggctttaa atggtaagat atcatattct tggatgaaaa gatctaatgt 13680 
caaaatatat caatttaatg caattatgta tattcaggag atctctggtt ggcttttgaa 13740 
cttgatagca ctcttataat tcacatagaa gaaaaaaaac catgaaaact gccaaacatt 13800 
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attagaatac 
tgtagatcat 
gggaaaataa 
ttctctctaa 
atctgtacgt 
aaattatctc 
tctggggcgt 
aatacttctg 
atagtacact 
aatgcaaaca 
gtgctgggcc 
agttcaaggc 
ctgaggatat 
gaggaaaaaa 
taccacaggg 
ataaagaagt 
agaagatcag 
gtttctttat 
tgtcaagtgt 
tgtgcagtcc 
cctctgcctc 
tacttttaaa 
aaactcagga 
cagagaactt 
aagatacaag 
tgcttccatc 
agccatttga 
catattaagg 
tgaggctcta 
aggatgacta 
caggcaggct 
atggctacat 
caaacactat 
gcccatctgg 
tcactggaaa 
tntcagagga 
nnnnnnnnnn 
nnnnnnnnnn 
atagctttca 
accaggcctg 
tgacaatcag 
gtgaacgcag 
gtccttaagt 
acaacttgat 
aatattgaac 
agcttccttt 
aaagtaggca 
ttattgcctg 
tgtgtgtaca 
cctttaacaa 
acaagcatga 
cattgacctg 
tactggcaac 
agatgaaggg 
aggccagtct 
accacctact 
taagatggct 
ggaccccaca 
gtgtatgcag 
cctttaaacc 
agcctacata 
aatgaataga 
agaagaggac 
gtgcatacac 



tccacagatg 
ccacgccttg 
cctggtaatt 
aatacattca 
ggatcaaggt 
tagattactt 
aacaagaaaa 
aatacacgga 
aataaccatg 
gatggaaaca 
tagaggccca 
cagcctagac 
agctcagtat 
aaaaataagc 
gaacgacaca 
gtcactgagg 
tgagcctcta 
gggggtaaga 
tcaaaaatca 
tggaactcac 
ccaagtgctg 
aattgcccag 
taccagtgtc 
tttggtttgt 
gctggcctca 
tcccaagtgg 
accacacccc 
caacaaaaga 
ggctaaatgt 
tgcagtcttg 
ctgtagagga 
tagatgagac 
gccttcccat 
aagagaaacc 
tacgaaaata 
cataatgcag 
nnnnnnnnnn 
nnnnnnnnaa 
gggatcaaac 
cttctttgtt 
caagtacctt 
tttctcttgt 
tgcttgggtt 
ataatcatta 
tgttctttca 
ctctattttc 
gacaacacag 
ataataggtc 
catacataca 
aacaagtact 
caaccagagt 
tagccctggt 
aaatctgccc 
cgaggcgcag 
ctgtgaccag 
gagactgtct 
tagaaggtaa 
ctgtagaacc 
acacacacac 
cagcattggg 
aggaatttca 
aggaaggaag 
accggcttgc 
ccacatacaa 



gtattttggc 
ctagggagag 
cctcattagt 
ccctcagcat 
agtagcatct 
atagtagcca 
tgtctctact 
tcaagaagcc 
tacaaccata 
atccaacagc 
ggtctataat 
aacttagcaa 
agagcatctg 
aaaatgtgat 
gtaacacaac 
atcaggaaat 
ggactcaaga 
aaacattcta 
tacttttttt 
tctgtagacc 
ggattacagg 
tgactcatga 
ttttatgtct 
ttcatcttat 
tactatatag 
tgggattaca 
agccttttca 
cataatgcgt 
ccagcactta 
tgtacaaagc 
cccagcttaa 
cctttctcat 
cacactcaga 
taagttacat 
cagtcttaag 
aaagtggaca 
nnnnnnnnnn 
tcagaggaca 
tcaagtgtgt 
ttttatgggt 
tctctacctg 
gaagtgcatg 
ttatgggaaa 
tttttaatcc 
ttcttacata 
aaatttgttt 
tagaaccaag 
ctgttttgtc 
tatatgtatg 
gggctgggaa 
tgagatcccc 
gctggatgaa 
agtaggtaag 
cggcacacac 
cagcctggcc 
cagaaataag 
aggcacttat 
aactcctcca 
taagtaagtg 
aggaagaagc 
aggcagccag 
aaagagagac 
tggtgtcttg 
ttgtactcag 



agcacataca 
ggcgggtgtg 
taaatcatag 
ctgcattgcc 
gcatttaatt 
agatgatgca 
caggttcatt 
acagaaagag 
tagtacacta 
ctgtaagctg 
tccaactaag 
gaccttgtct 
cttagcatgt 
ctgtctgcac 
aaaaacttag 
gcatgactcc 
gatttgggat 
aacttaactg 
tttttttggt 
aggctggcct 
catgcgccac 
atacaatcag 
gctgcccaac 
ttgttgcttt 
ctgaggacga 
ggggtgtacc 
gagaaacctg 
ggctaccata 
taaaaaagag 
cccgcattca 
ggtcatcctt 
aaacagaata 
ataaagcact 
tccttgcttg 
cttggtcaga 
aatgcaaann 
nnnnnnnnnn 
tctttcagga 
actgagcgct 
ctgaatcaat 
gctagtaaga 
gacttgatct 
ggctcttggg 
tttaaatagt 
ataaatcctg 
tatatttttg 
tccccatata 
tctgttttct 
tatgtatgta 
gacagcacag 
accaaccgca 
agctggggag 
ctctgggctc 
ctttcgtggt 
tacatgtcaa 
ttttttaaaa 
cactaagcct 
agttcttctc 
aatgtaaaaa 
gggtggatct 
ggctacctag 
agacttaaaa 
acctctggct 
acacacacaa 



tcgaagggct 
tgtggggggt 
tcagaacctg 
aggaaaccac 
gacattctcc 
aattatgtta 
caggtgcagt 
ggctaaccat 
atattcagtg 
aaaaacaaga 
gtcgaggcag 
caaaacaaaa 
gcactgacag 
aacaggatat 
ccctgaaaat 
atttacatta 
tggcagctaa 
tgagaatgac 
ttttcaagac 
cgaattcaga 
cattgtccgg 
aggcgggaga 
tatcaatttc 
tggtagaatc 
ctttgaactt 
gctatgccca 
tacaagcctt 
gagtgtttgc 
ttaaaaacac 
atccccagca 
aggtaagtta 
aataatttaa 
ctactggccc 
tgtcatatgt 
aagcttcccc 
nnnnnnnnnn 
nnnnnnnnnn 
gttagttctt 
tatgcccagt 
tagcaccatt 
gaagtaagtg 
ttgctcacaa 
ttttttgaaa 
atgacttatt 
ccttaaaaaa 
catgattttg 
gctgtgcaca 
cacagggatg 
tgtatgtatg 
ttagttatgt 
taaaaagctg 
gcaggtagat 
agacatccta 
agtgcttgag 
gttgcaggac 
aattgagatg 
gaagccccga 
agacctccag 
acatgacgta 
cttgagtttg 
aaagtagctg 
aatatatgct 
tgtacacata 
acatgtactc 



gtgaaagatg 
atagctgttt 
gactagcaac 
tcctagcagg 
taaatgcttt 
cactgtatta 
acttcccctg 
atacaagcat 
cattactcaa 
taagcaaaat 
gaggatctca 
agtaaagagg 
ccgtatcaca 
cacagccccc 
actatggtaa 
tatagaaatg 
agggtactgg 
tactcaacaa 
agggtttctc 
gattcacctg 
ctcaaaatca 
ggacagtggc 
ccatagttac 
tcaatatagt 
ctaatcctcc 
gcaagcacaa 
agtgccttag 
ctaccatgtg 
tcatgactca 
ccgtgcacat 
gaggccttag 
agctcctgat 
tttaaggact 
gataacaaac 
agcaacatga 
nnnnnnnnnn 
nnnnnnnnnn 
tcctccctct 
gcgccatcgc 
acaacaatgt 
cctttggtgt 
cgttttttag 
agattttact 
ttaacagatt 
taatcctctt 
aacatttata 
tagcttcaga 
tgttattgtg 
taatgaactc 
gtttaaccgc 
ggcatagtgg 
cggcagagct 
tataggaaaa 
aggcaggggc 
agccagagcc 
aaggagctgg 
gtttgaccct 
cagagcacaa 
gtggcactgg 
agaccagctt 
tttatgaatg 
ggagagtaac 
cacatgtgta 
attcatatac 
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tgcacacctc aacactcaga aaatgaaaaa acaggtacca tttacacctc cgtgttcggt 17700 

ttccaaccac tcatatgtat gggttgtaaa tgcttatatc tgtatgtgtc tgtatatttg 17760 

tgtatacatt caaagttgag tcaggatcca acgtaaactt ggatagtagt gggttgatgg 1782 0 

tctggaagcc tgctcgcagc tgtctttttc tcctcgtacc ttttcccctg tttgtttcta 17880 

cgacagcagg tcatttgtct ctaagtgtta gtttcccatc ctctctcttt tgctgatggt 17940 

agccttgtag tagtcacctg tgttctctgt aaaatggctt tgccgtgtta tttcaatatg 18000 

ctatcatcct catcttgcta tatttcattc aatatatgta tatattacaa gatagattaa 18060 

aattatttta attttatgct tatgaatgtt ttgcctaagt atattgcacc ttgtgtgtct 18120 

agtgtccaca gaactcagaa gaaagtgtca catattctgg aactggaatt gcaggtggtt 18180 

gtaagccacc atgtgggacc tggaaaccaa atccaggcgc cannnnnnhn nnnnnnnnnn 182.4 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18300 

nnnnnnnnnn nnnnnnnnnn nncctttggt caaagatcct cagtttcgac tttgattacc 18360 

cagacttcct gtttctctca tggaacagtt tccccctgag atttactagt ggaagaaagg 18420 

cactcaaaaa gcagggagcc ctcgtacaaa tgagacttcc tagctatata attaggccga 18480 

gatgcacaca tccacagtca ttacccttct tcagagcctt tgtcatgtca agtgtatttc 1854 0 

gcccatgtga actttagaac tggcttgttg tgtttcataa aaagtgtagt tgggctcttg 18600 

attgggattg tgttaaattt atagattcat ttagggagag ctgacaactt tacagtatta 18660 

aatgttttca tccgcggaaa aggttgtctt gccacttact tgggctttct tttatgccct 1872 0 

taagtaaagg tttatagttt tctttatatc agtcttgcac atttcctgtt agatttattt 1878 0 

ttgcttgtaa ggtttccttg gttggtattg tgtataaaca ttttctccca attacatacc 1884 0 

ataattgatg gttaggagta taaataaaag gtagaatttt aaaattgctg tatgaatgac 18900 

tgctttgcta taattcaaca tcttttcatt tttatgccaa tctacttgac atgtttaggt 18960 

taaatgatga tatctgtaca gagtaatcct ctgatccagt atttgcacat cttactttct 1902 0 

aacgtccata gcatagatac acatcttata ctgttgagta catatatatt taaggtattt 1908 0 

accatagtct tataatatgc agcgtgcttt ggttcaagac agttgccctg tgttcctcaa 1914 0 

cattaacatt tttttcatca caaatacaca ttgaccttta tcaaattttt aaaactatct 19200 

tgagagaaat gaccattttt cttaatctgt taatgtaaaa tttttataaa aatagttata 19260 

aataatatta gcctacatat ttcttctgtt ctctttttca actcttagaa tcagagtatg 19320 

gtagtctcag actaaaccag gagcttccta tctgtttctc tgttcttaag tcacttatat 19380 

aatgtaagga tgctgtgtat atctgccagc taggccttat atacaaaagg cacccatcac 1944 0 

aaccttctaa aacagtctta ccacttagag accatgttca aacatatggg cctttgaggt 1950 0 

aattgccaca ttcaagctat aatattgtta tctaagggaa tatcttcact tctagcagat 19560 

gcctaaaaat atctaaaggt aaacactggt aattgctgtg tttgttgatg ctgctcttcc 1962 0 

tcctcctcct cctcctcttc ctcctcttct tcctcctcct cttcttcctc ctcctgcttc 1968 0 

tccttttctt catcctcctt tcttttctta tttttgaggc atgatttcac catgtagccc 19740 

taggtaaccc gtaacttact atgtatgtag accaggctag cctctgtctc ctgagtgctc 19800 

atattaaagg tgtgtatcac catatccagc aacacttgct ttgagatggt tagaggaaaa 19860 

aaaaatatac gtaaataaag atggatgcca attactaaat tgttacttcc agtcaaactt 1992 0 

tgtacctagt ctaaggccaa aatagggatt ttttttctac tttgcaagtt ggctccatta 1998 0 

agaggctttt cttctcttgg tctcactaga taggaaggag agagaggagg gaaggagaga 20040 

aagcggttga ggagtgggag gtagtgtgac cgagaatacc cagtaggctc atatatttaa 20100 

atatttggtc cctagttgat agaactgttt agaaagatta ggaagcatgt cttaggggct 20160 

ttgaggtttc aaaatttaat gctagaccca gtctttcaag ggagggggcg gtctgtctct 20220 

ctctgcctgc tgcatgcaga gctctcagct actactctag tgtcaagcct gtgtgcttcc 20280 

tgcctcaatg atcataaatt aactgtaagc aagcctccaa ttaaatgctt tcttttatag 20340 

ttaccgtgat catggtgtct cttcacagaa atagtaacct gtggtgattt taatatgcct 20400 

ggaccaggga gtggcacttt taggaggaat ggccttgtta agaggaagtg tgtctctgtg 20460 

ggggtgggca atgagaccct cgtcctaacc atgtgagaac cactcttctc ctattggcct 2052 0 

tcagatgaag atgtagaact ctcagatcca cctgcaccat gtctgcctgg aagctgcctt 20580 

tgttcccacc ttgctgcccc aattaaatgt tgtacttata agaattgttt ttggggggct 20640 

ggagagatgg ctcagcagtt aagagtactg actgctcttc cagaggtcct gagttcaatt 20700 

cccagcaacc acatggtggc tcacaaccat ctgtaatggg atctgatgcc accttctggt 20760 

gtgtcagaag acaggacagt atacccacat acattaaata aataaataaa taaataaata 20820 

aattcttttt aaaaaagaat tgctttggtc atggtgtctg ttcacagcag taaaacccta 20880 

acataaccct gactaagaca acaagtgagg aaaggtgttg tgtgacactc tggatctctg 20940 

gaagctcacc tcagcatgaa gcttgtcgaa gcgnnnnnnn nnnnnnnnnn nnnnnnnnnn 21000 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 21060 

nnnnnnnnnn nnnatctaag tacactgtac tgtcttcaga cacaccagaa gagggtgtca 21120 

gatctcatga cagaggttgt gaactcagac ctttggaaga gcaatcagtg ctcttaactg 21180 

ctgagcatct ctccagccca aaataattct tactagtaac atggaacaat caagttttat 2124 0 

tatatgatac atattaatca acttataagt acatgattat gcacatttat catatcgtgc 21300 

aaccatcact gctgtcgttt tgttttgttt tgttcttttg aggcccggtt tctgtgttgt 21360 

tctggaactc actctgtaga ccaggctggt cttgaactca atgatctgcc tgcctctgcc 21420 

tcccaagtgc tgaaaacaaa tgtgtgcacc accacctctg gctatcactg ctgtcttttt 21480 
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ttttttttta 
catggtatgc 
tgggttctag 
gccatcctgc 
caaacaaaaa 
aagtcctgaa 
aagccttcct 
tttctcctct 
accctgccta 
gacactgtcc 
cgtttccaga 
gataaaaagt 
ggaggcagag 
aggacagcca 
aacaaacaaa 
ccaaaaacca 
agaatttcac 
aggcattcat 
tgcttagtca 
ggagtggcct 
tcacatccaa 
acacaccttt 
ttgaaagtga 
agaggaaaga 
accagggcat 
agagaatgag 
gcattttagt 
agcagcaaca 
ttactcagca 
ggctagacgc 
gacaacagtt 
ccagcgtcat 
caacttttcc 
gggaaggttc 
gtcactgtta 
tgcagatcaa 
ttataaaatt 
gttggtttgc 
ggtggcagat 
gaggacagcc 
gtcttgaaaa 
agctttcaca 
tattcattct 
catggataaa 
acattctact 
ttgtgtctat 
tgctccatcc 
atgtatgtat 
acacttggga 
nnnnnnnnnn 
nnnnnnnnnn 
gaaccgtgtg 
tagtttaaat 
accatgtgga 
gagtgacaga 
ctcttaatag 
tttattttgt 
tcccctcaaa 
accacatgct 
gccttagaga 
aatgcacact 
ggttgtgagc 
ggtgctctta 
gtaaggtggg 



acagttattt 
atgtggaggt 
gaaccaagct 
tggccctact 
ataaacctca 
cgaatgtgtt 
ctggtcctgt 
ggatgactgt 
cttaagctgt 
atcagtgaga 
accaagcctt 
tttcaaggag 
gcaggcaaat 
gggctataca 
caaacaaaca 
aaaaacaaca 
tgtttaaaaa 
gtgtgtagtg 
ttgagtagca 
tgttgcagga 
agacattcta 
aatcccaaac 
cgtttaatta 
gaggcagcat 
ttgtacagag 
aaggagccag 
cagaaactga 
gctgagttaa 
gcaaatctca 
ttccagggct 
aatacagaag 
gcctgtctgc 
agagatggtc 
caagagactt 
tttagtataa 
cataaagatg 
cctgatttga 
cagaaagatg 
gcctttaatc 
tggtctacag 
acaaaacaaa 
actacaccaa 
agatcaggtc 
ctgttgtttt 
gtaactttcc 
aaaggtatag 
acccaaataa 
atacatacat 
agttgatgag 
nnnnnnnnnn 
nnnnnnnnnn 
cctctgcctc 
gtttcttatt 
tgcatggtgc 
gggttgtgag 
ctgagcctta 
tttgagacag 
gtaaagatcc 
cagcacagcc 
taaataaatt 
gtagctgtct 
caccatgtgg 
cccactgagc 
atcatctcag 



atttcgtgca 
cagaggacaa 
caggttgtta 
ataggtcctt 
aaaaacaaaa 
ctgttgacat 
gaggatgctg 
aaccccagat 
acacaatgaa 
gcacagccta 
gctggataga 
ggccaggtgt 
ttctgagttc 
gagaaaccct 
aaaagccaaa 
acaacaaaaa 
aatggctgaa 
gtttaaataa 
gtactagaga 
attgtgtcac 
ggtctatgtg 
aatgaaggta 
agagtgatga 
aagagggagc 
acaggttgca 
gagattagga 
gagaagtcaa 
actagccaac 
gaggctaaaa 
aggcctaggt 
actatgtaca 
attgttagga 
cactgttttt 
catcccagga 
tgactcctga 
aactttcaca 
tttaaataat 
taataacgtc 
ctagcacttc 
agtgagttcc 
aagaaaagta 
aaagagacat 
caaggatgaa 
gagcttaaag 
tttttataaa 
aaggacagag 
atatgtgtgt 
gtgtaggtag 
acaagtgaga 
nnnnnnnnnn 
nngaattcac 
taaagtgctg 
tttttgttta 
ccttagagtc 
ctgggacttg 
tctccaggcc 
ggtgtgtagc 
tcctgcctct 
atgattcata 
attttttttt 
tcagaccccc 
ttgctgggat 
catct caeca 
tetcegggaa 



tgcatgtgtg 

ctttcaggag 

gaettgeatg 

atataaaaag 

ggaccatata 

gtctgtactt 

atagtctgtc 

tgttcctctt 

tgagctgagt 

gectaacagt 

gcatatttgt 

ggtggcacac 

gatgccagcc 

gtctcaaaaa 

aatccaaccc 

gtttttgagg 

taatatttca 

aaatagcccc 

gggaattgaa 

tttgaggtcc 

atctggctgg 

aagttagttt 

attagagaaa 

atggcagagg 

gagctagaac 

cagattgeca 

attgaatcag- 

agaatccaga 

acatcttaga 

tagcagaegg 

gacatggata 

gtcatctagg 

tgcatggctt 

ttgetgetta 

gctttagccc 

aattaatget 

tttaggaaga 

agaatcaaga 

ggaggcagag 

aggacagcca 

agtaaaggct 

gcttgggaca 

gecacaagtg 

aataaaacac 

ttttatctat 

agaaagaaat 

gtgtgtatgt 

gtatatgtgt 

ggttgggccc 

nnnnnnnnnn 

tctgtaaacc 

ggattaaagc 

tgggtctttt 

cagaagaggg 

aacctaggac 

gtcccatgga 

teatgettga 

gtctacagct 

acaataaaaa 

taaagattta 

cagaagaggg 

ttgaacttcg 

gccccgagat 

tettgectgt 



tataagcatg 
ttagttcttt 
geaagtgect 
ateatatgee 
atattgtggg 
cagacccatg 
taaaaactag 
cagagactgt 
ttccaggtta 
acacatgtct 
ctgtttggct 
gectttagtc 
tggtctacaa 
accaaaaaaa 
cccccaaaaa 
tttaatttat 
ctatccattc 
cataggcttc 
ggtgtggcct 
cagcaacaag 
aattcagaca 
ataaaaagaa 
gatctgetgt 
gagagggagg 
aggtgaagac 
atgttaatag 
ttagcttgga 
aagaactaga 
cctagattag 
agagagtaat 
tgaacctctc 
aaaggctaag 
ttgagaggcg 
attacgacat 
atcctattgg 
gtttagatga 
aagctttagg 
atagaatgtg 
ataggeggat 
gggctacaca 
gcataataaa 
aatttgtgat 
tgtgatatga 
tgctttgaaa 
gagataattt 
aaggtgtggc 
atgtatgtat 
atgtatataa 
cennnnnnnn 
nnnnnnnnnn 
atgctggcct 
atgtaccacc 
acctgtatgt 
tatcagatcc 
ttctaaaaga 
tttggggggc 
atttactatg 
gctaggattc 
aagaaagaga 
tttatttatt 
agtcagatct 
gaecttegga 
aaataaatta 
tactccttcg 



taacgtatgc 
cctcccactg 
ttaccacaga 
gggcaaaaac 
ggagtggatg 
ggaattggca 
agatcacagc 
ccaccaagct 
cagcacagta 
gctttcttca 
tatttcactt 
ccagcactcg 
agtgagttcc 
accaaaacaa 
aaaaaccaaa 
tgcatgtcac 
aegtatttgt 
tacagttgaa 
tattggagta 
gttgctctga 
tgcccttaat 
gcacccatgt 
cacagagcag 
aggggttttc 
agaacaagee 
gctaagcaga 
aaggagtttg 
aaagatgagc 
actgcatgga 
aagccttgga 
agccacttct 
ggcaggcaag 
agctctgaga 
gecttttett 
gcatatttcc 
ataaatgatt 
agatagttta 
gctgggcagt 
ttctgagttc 
gagaaaccct 
gaatacaatg 
caaggaaaaa 
tgaacaagac 
ttaactatca 
tctaaagaac 
atctgggctc 
gtttatctat 
gtatgcatga 
nnnnnnnnnn 
nnnnnnnnnn 
tgaactcaga 
acaacccagc 
atgtgtgtgc 
cctggaactg 
gcagcaggtg 
tttgtttcat 
aagccctgac 
gaggtcttgt 
gacctaaatg 
tattacatgt 
cattacagat 
agagcagtcg 
taatgtatgc 
ctctcccttc 
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25320 
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tattcatgct tgggtaactg gccctggctg attgatgaga gctgatttcc ccactgccct 25380 

gtggcaggga ccactgcgcc cacagggctc cctcaggatc ctcagtacag agctgcacag 25440 

ctgggtggaa gtagagggct gcatatataa cacgatctca actttatttc tttaaataaa 25500 

aattttattt aaattttata cagctctata taaacgaagg aactattgaa ggttcagcaa 25560 

ggacctgcca acggttgtca agggtaatgg cgatgtagtg attttttttc ccccttccat 25620 

tttacttcca tactttctac attaccccac aactggcaag tattatttta aaatgaaagt 25680 

aaatagtgac agatgacttt gaaggaaaat tgaatcggta aaaagaaagc tgagagacca 25740 

cccgggaagc ccaggctaaa tgtaatctgg gtcaggcctc ccaggcctgg ggtctcaaga 25800 

tggtcagctg agggaccctg gtgaccctct tgggccagca gggacgggga ggagccggaa 258 60 

gctgagtacc caaagtgctc ctctgggctc caagggcctg cacagagact gtgtgggaat 25920 

caaaggatac aggcatgagg actgaggcct gacgaaccca gctatcattc gtcctagaac 25980 

aggaggcaga gctccaagag tccaaccaag aggcaggaag ctttgggacc cgagatgggc 26040 

gatgggatta gaaaggcatg tttgcaaata ctttcaaatt tacgatgcac actcactgga 26100 

aaccccaccc ctgggtgtcc cttccctgcc tcttgccaca cccaatagct gacatcactg 2 6160 

gagaaagtcc caagaccagg ctggctggag ctcctgatag gttccaccct cctgcagagg 26220 

gccctcgaag actagcttgc tcgcccacac cgccagatgt ctgtgtcttt ctctcttttg 2 6280 

cctcccaccc tcgtctcttc ctccaacctc agtggagggt cccctgcttc ctggggaaag 26340 

tagaacttgc cagtgctcac tgtaatgtcg tccctgtagg tgtcatggtc ccccattact 26400 

gggagcaggt atgcctcaga tctccctcta ttcgctgccc tttcaggctg tctcagtttc 26460 

tctctgacag ttcctctcct cctgaatcct gcttgttggc atgcgaacag gctcaatatc 26520 

ttccatctca aaaaacaaac actgggaagg tgttgagaga cagagagcat gggtaatggg 26580 

tgccccagct tggctgggaa ggggtaactt acaatgctct actgcccagt agggtagctg 26640 

cagttgtcaa ttaattgtaa atttcaaaat agctagtaga gaggatttta gatgttccca 26700 

atcccaacac aaagaaatga taaacattca aggcgatggg tatgctaatt gctctgatct 26760 

gatcaccgca cattgtatac atgtttttga aatgtcaggc tgtaccccat aaatatgtac 26820 

aattaccgtg cagtgattca agataaaaac tataatttta aaaagctaaa aacagaagga 26880 

aatagctgcc cttgaccccc ccacccccac aaggtccttc- ctgtttgtcc agccacttaa 26940 

tgtcagagct tcctgtggga gggtggtttt ggtgtacaca gacactcctt cctccctcct 27000 

tccccataag aggagtcacc cctgtcccac gatgccatgc' agggccacat gcgtgatatt 27060 

aaccagtaag atgtgagcag ggatgatacc tgtctcttat aacaaacgga aaaaaaacca 27120 

caccaaacca aaaacaaaca aacaaacaaa caaacaaaaa cagggttggt ctgtccctgt 27180 

gtcttttccc acataaagtt aagcacacaa agtagccacc atttatttat ttgtcccctc 27240 

ccccacccct ccccgagaca atgtttctct gtataacagc cctagctgtc ttggaactca 27300 

ttttgtagac caggctggcc tggaactcac agagacacag agattcacct gcctctgcct 27360 

cccaaatgca gggattaaaa gcatgagcca cgaactaacc agtaccccag agctcttgac 27420 

tctagctgca tacgtatcaa aagatgacct agttggccat cactggaaag agaggcccat 27480 

tggacacgca aactgtatat gcctcagtac aggggaacgc cagggccaaa aaaatgggaa 27540 

tgggtgggta gggaagtggg ggggagggta tggnnnnnnn nnnnnnnnnn nnnnnnnnnn 27600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27660 

nnnnnnnnnn nnnggcagga tcctgtgttc atgtgcaaca ctcgatgcaa gctgtgtagt 27720 

gtttggttct gagcacctga aggggaccaa gcaggctgat gcccaggcca cgggttcttt 27780 

ctggccccac tgcccactcc caccctctgg catccccatg atgaacatgg ccacagatca 27 840 

cactactctg ctcctctccc agatqcacgg agccataggg tccccagatt catctctgca 27900 

gctaacaagc tgggcagtgt cacctccctc aaggttcctt tcctgctctg agcagcagtg 27960 

tctcccacag tgagacactc atgtccactg gaagatattg tagccattaa attcctgtgc 28020 

taaaataact agggggactt gtcaatcact acactcttag ccccggactt ctgactcata 28080 

gagggtggtg acagctcagg gacctgcatt ctaccaaata gccatgtgtc cctgatggag 28140 

gaactgcccc tggacaacct ctgcagcaac tgaaccctct gtggtctcct agttcttctg 28200 

gacaggtgtg accccagtac ctagtgccag gtgagagagt gctagggcca cactaagggg 28260 

tgacaggaca aggttggagc tggtagatgt ttgggccacc aaagagaaca ggtcagtagt 28320 

aaaagccatc atggcctgag ccagcctgcg agtctcctct gcagttggga cactcttgca 28380 

gtgtcctggg gacctcttga gggtagcatg gtcaccaaaa tcctacaagg acagatcaga 28440 

agtcagtgag gtcaagggaa cagctctagg ttctctgtgt ccctcacgga cctttttttt 28500 

tttttttttt tttttaagat ttatttattt attatatgta agtacactgt agctgtcttc 28560 

agacagctcc agaagagggc atcagatttc gttacggatg gttgtgagcc accatgtggt 28620 

tgctgggatt tgaactcagg accttcggaa gagcagtcgg tgctcttaac cactgagcca 28680 

tctctccagc cccctctcag tcctgatgcg acagggcagc aaaggccttg tcccagatct 28740 

gaggagagtc atgctgaagt ccttcctacc ccaccccttc cgaacccctg aacatcagcc 28800 

ccataactac tgactccccc acccccattc ccttgcttcc actgatccgg tcctcctctt 28860 

ccctctggcc ccacccattc ttccccagcc ccacctgatt gtacctggtt gtccaacttg 28920 

aagagggcag gcaggggcag cttctgctgg gcctgctcac tcactggctg tagaaatgag 28980 

aaaggagatg aagaaaaggc ccttcccatg ggtccccatc ttgccaagac ataggtgagt 29040 

ccctttggct cttcccccta aacctctcac ttttgagtac ctgctggccc gggagatcca 29100 

cggcgctcac cggagagaac tgttgagaaa agggagaaca gagaactcag cgttcctccc 29160 
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tctccaccct tctggcctct cccagatttg cccccgcccc ccagcatctc cttcagcctg 29220 

actgaccact tcccactcag acctcagctc tgcctcaccg tgaaacaggg accttgcagg 29280 

caggacaagc tgagtacgag gagcccccgg agcagtgcca tgttcctgta tccagaacag 29340 

ggagtgttag ttcctacctc acgctcgaag gccaagcagt agactgctat ccatgggttc 29400 

cttgaccgca ccaggctgcg gaacctggac tcaaaacata gcagctgtgg acctcactca 29460 

ctctgagagg tgggatttcc ataagctttt tttttcacct gtacatttag tcttcattct 29520 

tttcgtctta cactgtggat cagtcctggg ttcaaattta aagccctcat cttgcaagag 29580 

gaccttgcgc atctcccttc atgcctttgg ctttaccctg tcttggtaat tcatggcaga 29640 

agttcttcct gctcccatgt agatgttgag gacccaaata agaatctctg taaatactga 29700 

gcatgatgcc tggcccccac cctagcaaag ccacctgacc tgttgttcat ttcatccagc 297 60 

ctttctcagg ctgccctggt cctacccaaa ggctctgaga gctaatctgg gctggcaggg 29820 

cagccagaaa cttctttgtt gaccaatgaa tgactggccc agacaccttt ggacttacgg 29880 

gaactacaag cctcatccca cttctgctcc aagttctgat ccagggtgct tcggggaagc 29940 

ccagctggcg gaagggggga ggctctcagc ctagagagcc ttcctttcca tcctcagccc 30000 

cctacccagg ccttatttca ggcaccagct cttctaaaag gtccttctgt tatccctaga 30060 

cctccacaac tgtgttcaag aaccttcagc cagggcctca tctccaatct ggatatatga 30120 

tttttctcgc caagagtagg cctccaggtt ttggagttct agaggtttct cctggagctg 30180 

cctggacctc tgctcctcac caccccagga cgctgtgaag ctgcaggctc cctgaataaa 30240 

ttcatccaga ccccttgcca aggtgccagc tgtctacttc ctctgctgcc caagcagcag 30300 

gctgcaccac ccctccatcc tacctcttca ggcttcttag cgcagcacac gcagcacacg 30360 

gtgttctcct ggaccagctt gctccccacg ctcccccagt gcagccagca gggcctagct 30420 

ctctcctccc acaggacctt tgctctcagc caacccccgt tcagcttgtg ttcagtgctg 304 80 

gtaaatattg acctgtacat ccggttaaac attgatatgg gggccagaag accctttccc 30540 

atcaaggcta cccagaaccc tgcctgagcc tggagaaggg gtttacagga gcagataagt 30600 

gaggaggttg ggcctggcaa gccttctaat gatccctcaa cataggggat tatccacagt 30660 

cagtgaggct cagagaggct gtgtggcctg tgtaagggcg cagagtgggc tccagagtca 30720 

cagccaaagt cccaccacca ccaccaccac caccaccacc ' accaccacca ctaccaccac 30780 

caccaccacc accaccacca ccaccaccac caccaccact accaccacca ccaccaccac 30840 

caccaccacc accaccacca ccaccaccac cacctcatct acccatacta anttgaggct 30900 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tctatgaggg ttacatttta 31020 

gaacatctct ccttttttcc tttttgagac aatcttacta tatttaggct ccccttgaac 31080 

gtgtgatcct tctgcctctg cctcccaagt gctgaggtta caggcatgca cagtcacatc 31140 

tgcctacaca atgtcttagc agcccttagc agcaccaggg gtcaggaagc cctcaactgt 31200 

ccctttagct ggcttctctt gtgaagggct atgtcttctt ccccttccta gcatggagac 312 60 

ggctcttagc cccagagcct tccttccttc aggttaaaca gcaccagttt ttggtgggac 31320 

ctcccatttc cctctatctc cctaagcaac gaccttttct gctctgactc tcatctggca 31380 

cttggaccac aagacaaaac tgcagcctgg gctgtgtgtc ctcgcacatc attcctgtgc 31440 

ccccctggag tcaggtctag gggaggaaga cagggttcac gactcagaaa agaccactgg 31500 

ctgtcctagt gtgccctcac ccatcctata gcacgcacat gctgatgtgc cccctccgct 31560 

ccatcaccat cctctcatgt acacgtgccc tccctcgcca gacacatgca tcactaactt 31620 

ttctgacttc ccagaaaaat atctgatctg agaagttagg agtctgccat catcagctat 31680 

ggtccttaaa attaagtcag acaatccatg ggacatgaag ggcaacaacg agaagactcc 31740 

tcgttccttg ttcactctgc ttttggcagc accaccagca ggaaccaacc tggctctccc 31800 

taatccctca tctatagcag gtctcccggt gggaatttta gggacctctg tgttctcatc 31860 

caggggcact gccactcagc tgctcaggga gagacccctt agaacaacaa agaaatcaat 31920 

gcagatttag gcttcttgtt tcccttccca gcccctccca tcacaggcaa cagcctccct 31980 

tggctgagcc tcaggaggct gatttatcag agaggtgctc agagaggcac ctctggtccc 32040 

tctgggtagg tagcaactga gacaggagga gatggtcacc ctgggcatcc tctaccagga 32100 

agtaaatgag atacccttgc agatgggacc cctgaagttc ctcccggggg cgggggtggt 32160 

ggtggtggga gtctaagtca cagatctttg ttaccacgtg gttagactga ggactgaatc 32220 

tgaggtggga aatctgatgt gcatggggaa acacagaggt ccaatgctgg ccaagagcta 32280 

caagcaggga caggtgctag ggggatgtct gaatgttcca ccccaagcca caggaataac 32340 

ggaaatggag actctaaagg gcagaaagtg agggtgtgca gcaggggctg cacaggacac 32400 

atgcaaggcc ctggctgcaa taactgggtt ggggaggcag tcattggcta gccaggggca 324 60 

ccaggacagt gatgccatcc tgtccaaagg gcagtgtcca agccagattt ctaggctcca 32520 

gggggaggag ggtccgggga gaggggtcaa gattctcccc ctctgagtca aggttggcct 32580 

tcccatgtgc cccaaatcag gaggcacaga aactgggatg ttgtggtctc acatccaagc 32640 

tgagaagaca agtgggagcc agtacatgtg tttcagatta aacccagtcg gagacaaaca 32700 

tgttgctcct cctcctccca gagccaagct gccttcaagc cacatggcag tgaatatgcg 327 60 

gacagtgcag gggaggacac ctctctctcc actggctcaa ggacagtttc aaggggttca 32820 

ggctggctgg ctcatggcta cgccgctcac cccctggaca gtttggggtt tttccctcct 32880 

gaaatcttgg aatctgaatc agcctgagat accccataat tgtacctccc aacaccccca 32940 

gaaaggtcag ccctgcagaa cagaactctt tggtccccac ccatccccct cagccctgga 33000 
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ggctgaactg 

ttcctttgac 

agggcagcca 

atgacctcat 

aacctagcag 

tctgaaaagc 

tcactgctgg 

ggtactgctg 

cctttcccaa 

ctaatagagt 

ggttcctaaa 

tgctctctgg 

tgtcttcccc 

gcctgcactc 

ccagcaggct 

tgtgattgca 

tccaagacca 

tcttcattgg 

ggggtaggtc 

tggagagact 

gctcactgca 

tgggactgta 

tggcccagtg 

cacatagcaa 

ctnnnnnnnn 

nnnnnnnnnn 

catgccttta 

cattgtctac 

ataaataggg 

ccccagcaac 

tcaaggtgga 

agtatgctcc 

ggtaacatgg 

ttcagtggcc 

tttatgctgt 

aataatttaa 

acatgtgtgc 

gggttgttag 

ttaatcactg 

ggctcagagg 

ccacatggtg 

agacagctgc 

ttgttttttt 

tgaaactcgc 

ccaagtgctg 

caccaccacc 

gaaaatgaca 

tgcataaaca 

ggtttggatt 

agtagaactg 

ccaaggggaa 

ggctgcttca 

gctgttgatt 

tgttagtaat 

tcctcttcct 

gcagagcccg 

aacagcttca 

ggctacactg 

tctgtagagc 

gggcccccgt 

gcgggaaggt 

cattccactc 

tcttgctgaa 

attaacacca 



atgggcagct 

tcagaccact 

gaccccacaa 

ttgtctccca 

gcctcaggca 

agcgatccga 

tggaagcaac 

agtccagccc 

caccattcaa 

cctgggcctc 

ctgcaggggc 

ggtgcccctg 

aggctagacc 

tgaggggctc 

tcagattaca 

tcattgtctg 

cacgttggcc 

gaccttcatt 

atgagtctga 

gtgagccagc 

ggtgggagca 

cctgacagct 

cttatgagca 

ggactcgaaa 

nnnnnnnnnn 

nnnnnnnnnn 

atcctagaac 

atagcaagtt 

ccagttggca 

catgtaaaaa 

gacagacgga 

cagccagtga 

tggctcagtg 

actaaaggtg 

ggtacacaca 

aatgttatgt 

aatgcctgtg 

ctaccatgtg 

agccatctct 

ttaagagtat 

gctcacaacc 

agtgtactca 

tgtttttcga 

tctgtagacc 

ggattcaggg 

ccaccaccac 

cctagcattg 

cacataaata 

gtgtcccccc 

ttagggttta 

tctctggctt 

ccacatgccc 

tccccagata 

tgtgtgtgga 

catccacctg 

ttcatgttac 

aacaggagaa 

aagctaccgg 

actgagacat 

gtccgtgtcc 

gaggcggacg 

gaaagcagcc 

gtcgggtgat 

cacacacaca 



aaggtccaga 

ccctgccgtg 

accagctagg 

gctcttttgg 

gcccactgct 

atggagtcag 

ttcaggaaga 

cagagcttag 

aaccccagtc 

tgtcccccca 

tactataacc 

ccactcccag 

aggcgctgcc 

caggaagcag 

gctactcttt 

gtgttaactt 

accctcctct 

gggacatgat 

acttgaacat 

cggtatggag 

aggtgattct 

ctgaaggttc 

ctggctgctc 

ctgcttggaa 

nnnnnnnnnn 

nnnnnnnnnn 

tcaggaggca 

ctaggctagc 

aaatttagct 

gaagcaaggt 

tccatggggc 

gagactgaaa 

agaaaagata 

gaaggagaga 

cacgcacaca 

gfcatgggtgt 

aaggctagaa 

gattctggga 

ccatcccctc 

tgactgctct 

atctgtaatg 

tatacataaa 

gacagggaga 

aggctggcct 

tttgcgccac 

actacctgac 

tcctctggtc 

cataagtaaa 

aagggaaaaa 

ggcctggtgg 

catcttttcc 

agaagcaagg 

tttgtagtag 

ggtgccctcc 

cactccgaga 

aagtatgtaa 

ggaagaagca 

ccgccttcct 

taagtactac 

ctcagaacaa 

ggctggaggc 

ctgtgttcca 

tcaaacaacg 

cacacacaca 



cagtggctgg 

gcttacatca 

ctaaatggtg 

taagagagaa 

ccttgctgta 

gctgagctgc 

gcccagccta 

ctccctgctt 

cagcctctcc 

attctctctc 

ctacctccac 

tccctctagc 

ttacctgtgg 

tgaggggagt 

tcttaaagtg 

taacccactg 

cccaccacat 

ggctaacttc 

ctgaaaggat 

atgctgggtt 

tctcccctcc 

caaaggacag 

tcgcagagga 

ctccagctcc 

nnnnnnnnnn 

nnnnnnnnnn 

aagcagatgg- 

caggtacatc 

tgccctccta 

gtggtggcac 

tcactggcca 

aataaataaa 

cttgtcatac 

accaactcca 

cacacttgtt 

tttgcctaca 

gaggacatca 

acaaaaccat 

aatagtatat 

tccagaggtt 

gaatctgatg 

gtaaataaat 

cagggtttct 

tgaactcaga 

cgccaccacc 

tatttaactt 

cctacatgac 

cttagtctgg 

ttgggtccca 

gaagtggcca 

ctttgctttc 

ggagcagtca 

taagagacag 

ggggacgggg 

acaggaaatg 

attcataagg 

aacattaagg 

ggaatgtata 

tggggtccag 

agaggaaagg 

ctgggctggg 

cttgggtgag 

actgtagctc 

cacacacaca 



ctcttggaaa 

ggaggtgcaa 

cttattgttc 

tgagaggaag 

agggaaccag 

aggaagctca 

tgggactata 

cccaccaccc 

tactggtcta 

ccctctcatc 

ttccttgcac 

cagggagcct 

ttgcggcagc 

agctgcctct 

acctgactcc 

ctgcccttcc 

ctcccttgga 

aggggcactt 

tggctgagag 

cttccaggcg 

tcacctggaa 

tggggtgggg 

cctgagttct 

agagaatctg 

nnnnnnnnnn 

nnatgtcagg 

acatctgaat 

ataagaacct 

acacaagaac 

ttgcttgtga 

gccagctagc 

taaataaggg 

aagcctagca 

aagaattgtc 

acatgcatat 

tgcatatctt 

gatcccctcg 

gggttttcca 

atttctgggg 

ctgagttcaa 

ccctcttctg 

aaaccttttt 

ctgtatagcc 

aatccgcctg 

tccaaggctg 

ttaaaggcag 

cccatgtgca 

ttgttttgga 

gagtagtact 

tggctagaga 

aggtcctaag 

tggctgggac 

gtgaggaacc 

gccctcctgg 

gtgactttgg 

accagtttct 

aaaagctctt 

atcagcttcc 

gattctgcct 

ttggttaagg 

gctgcttcct 

cttcacgggt 

tgtggagatt 

cactccctgt 



gcctgtctct 

gggctgcagg 

gcaagaggcc 

ctggacagag 

caccgatggt 

ccttccttgc 

gctcctccgg 

accaccacat 

cagtgagcgg 

tgttcacctt 

ccctcttttc 

cttccatatc 

ttctctcaca 

caaccagcgt 

atttggaatc 

gccatgtggc 

tctttatctc 

gggccagcct 

gcaggctgca 

cttggctctg 

aatgaaggaa 

actagagagt 

gttcccagct 

acgctatctg 

nnnnnnnnnn 

catggtggtg 

ttgaggcccg 

ttctcaaaaa 

ccaaagtcaa 

tccagcattg 

tggtctactt 

gttaggaaga 

accctcaatt 

tcctgacagt 

gcatacaatt 

tctgtgcacc 

gagttacaca 

aaagaggctc 

ctggagagat 

ttcccagcaa 

ctgtgtctga 

tttttttgtt 

ctggctgtcc 

cctctgcctc 

ctgctgcggc 

ccatctcatg 

aacacatacc 

aatgtgctat 

attggaagag 

gacatgccaa 

acagtcacaa 

cgctaatgca 

ccacagcaag 

ggcaggacgt 

cagagcttaa 

ctccatatga 

ttattgcaga 

ctctgggggt 

atgaagagga 

tgatagtcta 

gccccctctt 

ttgccagtaa 

cagagattcc 

ttgtgtaggc 



33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 
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tgattttcaa 
cactctgtga 
acctggcctg 
acacacacac 
taatggtgtc 
cagcaaggtg 
catgtcctgc 
cagcacagct 
gctctcttct 
catacttcct 
tgtgagccag 
gctgtaacgg 
tactggcagt 
aatgaacaca 
aagcaacttg 
actaagggga 
ttgctccccg 
tgacactgcc 
acgggtttgc 
aaatgactct 
aagtggaatg 
cattgaacca 
ctgtggacac 
nnnnnnnnnn 
nnnnnnnnnn 
gagagagaga 
attccaacac 
taagtttcag 
cacacacaca 
tggggaaaca 
gccctcactg 
gaactcagaa 
tccccccttt 
ctgggttatt 
ttcatcagtt 
aactggtgtg 
gagcagtctt 
acaataaaag 
cctcgcctga 
ctccgcatgc 
taaatacttg 
caaagttcgt 
agattttggg 
gtaatggtgg 
tgagttcgag 
aaacccctgt 
aaatgttttg 
aaagtttagg 
gtgagacttt 
agagtccagc 
aaaggcattt 
gagtgtgctg 
ctcacataca 
gggtcaaaag 
gtgccacact 
tgagaccccg 
tctgcaagat 
caaacctagg 
agatgaggag 
gcgcttctgc 
agctcgggac 
agcccatagg 
acgaaccgag 
agccttgtct 



gaaagcaagc 
caggctcagt 
agagaatgtt 
acaccaagga 
acatgttccc 
gcaggctcca 
agagacttgg 
tggatagtct 
atcatggtca 
gtcaagggca 
aaaaaacaaa 
ggccccagcc 
ccttccatga 
cgtttgtttg 
gagaggaaag 
ctcagggcag 
tggcttctta 
cattgtgggc 
ccagaggcca 
aacttgtgtc 
tcccccagag 
gaaatgaaca 
agcgactggg 
nnnnnnnnnn 
nnnnnnnnnn 
gagagagaga 
ttgagaggca 
aatagccagg 
cacacacaca 
tctgagcaga 
gaggtgtctg 
ttcagtttca 
atctaaaatg 
tgcatctaca 
tcctataccg 
gtgctacttg 
gactactcag 
aaaagaaaag 
aagcctccac 
accatggctc 
taatgattag 
gcacattgac 
atgcttcatg 
catatacctt 
atcagcctgg 
ctctaaaaat 
taggctgcct 
ggactctaca 
gcagtgggca 
tttccataaa 
gcagtctaag 
actccacaaa 
tgtccacatg 
cagcaagaga 
ctgagcacag 
cctcactatg 
gagcccatca 
caggcaggat 
ttcacatcaa 
cagcacctga 
cccagtgagg 
aagaacaatg 
gactgcacat 
gacagcatgg 



tagaagtgga 
gacctacctg 
ttgggctgtt 
ctgagtctaa 
tgagccaccc 
agactcagcc 
taagttcgcc 
tcagttctcg 
agttctgggt 
gctgggcaat 
acaaacaaac 
ttgtgctccc 
gtagggagtt 
tggttccgct 
ggtttatttc 
gaactgaagc 
gcctgctttt 
tgggcccccc 
gtctggtggg 
aagttgacca 
actgcctgcc 
ataaaatgga 
cttcccagag 
nnnnnnnnnn 
nnnntggcct 
ggataaaagg 
gaggcagggg 
gctacacaga 
cacacacaaa 
ggaagaaatg 
cacaggagaa 
aagcactgac 
agtaagaacc 
aaagaggtct 
ctgactacac 
ctcatgatcc 
tgaatttgag 
gtggcgagtg 
acggaagggg 
gtgcaccccc 
tttctgaaga 
ctgtcgaccg 
ttgcagctac 
tgatcccagc 
tctacagagt 
cactaattta 
taacaaagcc 
agcatcctca 
aggttcagct 
acctgtgcag 
cccggtgaca 
tttgtcctct 
cacacatgca 
tgccctgtga 
gtggtactga 
tagcccaggc 
tgcccaccat 
ggtggactcc 
tctcccgtat 
ccaatgcaga 
gtgctggggg 
tcagctggcc 
gaagggatcc 
gaggggaggc 



gtacctcaca 
agaacttata 
ataggacaca 
tgggaggtgg 
tacaaagaaa 
ccgagcgcaa 
ttcgaagctc 
gtcgatgtca 
cacggtcagg 
ctaacccaac 
aaaaaaacac 
cagcctcagc 
ttcttctcag 
acggtttcta 
atctgacaat 
ggaagccgtg 
ttatgctatc 
cacatcaatc 
ggcattttat 
cacgaatcag 
agcactgctg 
tcctttgaac 
cccccatgga 
nnnnnnnnnn 
catcaggtat 
ttagcccagt 
gagctctgtg 
gaaaccctgt 
taagatcttt 
gggtgcgcag 
cacttgcact 
aggagcagtg 
aaaaaaacaa 
tggggaggaa 
aggggctgaa 
caacacttgg 
gctagcctgg 
aggtggccca 
aaagccagct 
acacccaccc 
acaatatttt 
tgtagtacgg 
tctgatgaac 
acttgggagg 
gagttccagg 
aaaaaaaatt 
caatggccat 
ggaaggccac 
ctttatgtag 
ggcctctagg 
tgagctcaat 
gatctataca 
tatacatgcg 
aaaacgtctc 
tatcgttcct 
tggcctggaa 
gttattgaag 
tgtaatctca 
gcgttggagg 
tgcagatgct 
gaggactgga 
aaaccaccca 
atggctccag 
cattggtcct 



agctatgagg 
ctgaagccag 
cacacacaca 
ccctcccctg 
tcagttctgt 
caaacatact 
tggggacagt 
cagaggtgag 
agatgatgct 
acaggttagt 
cttcccctct 
ggctactggt 
acagaagttt 
acatgactga 
tcttctcatc 
gctttctggc 
tgcccaggag 
aaaatgaccc 
tcacccttcc 
ttaggagctg 
ttgtatagag 
atcctagggc 
nnnnnnnnnn 



gtgacttgtg 
gccaagatgg 
tagagataca 
ttcttcattc 
gccacaggac 
agtggccttg 
agcttcagct 
tgaatgaact 
ggcaggaaga 
agagatgcgc 
caacagctgc 
ctgggctgta 
cctaaaaccc 
ttgtgataaa 
tcgcagggtg 
gaggaacgct 
cagaaccact 
actaatctag 
caattgagtt 
ggcctggttc 
accatttgct 
agatgtgttg 
atcannnnnn 
nnnnnnnnnn nnnnnnnnnn 



cagagagaga 
ggtggtggca 
ggccagtttg 
cttgaagaga 
aagaagaaaa 
gacacccacc 
cagcttgccc 
actggggacc 
aagtgtttgg 
acccagttct 
ggtaatctca 
aaggtaggtc 
gctacatgaa 
tcaggtaaaa 
tctacatgtg 
acccacccac 
cgttgatctt 
gatccgctcc 
agtgttgctt 
cagaagcatg 
atatccaagg 
cctttctaaa 
tcagagaagg 
agaaagcaga 
gaagaagaga 
caaagccctg 
ccttggaacc 
tgtatgcacg 
catgcgtgca 
attcagtctc 
gattgatcga 
ctcacagtga 
caataccatg 
ggacttgaaa 
ctggagtggc 
cacagccaac 
ggagctgaga 
gagctcccag 
atgcatatgc 
gtggaggttt 



gagagagaga 
cataccttta 
gtttacagag 
aacacacaca 
gaaaggatag 
ctcagaggag 
tagggcgtca 
ccaggttgaa 
gatttggaat 
acccctggaa 
atgttttcat 
agaagttcaa 
aactcataaa 
gtgccaacca 
gtcctctgcc 
ccacatgaca 
gtttagggaa 
aggaagctaa 
tctgggccag 
tagatctctg 
ctatacagaa 
cctatataac 
ctcaaaagag 
gcctgggcca 
gtcaacagtc 
tgttaggggc 
caggtggaag 
tgcacgcaca 
cccacacaca 
ccatcatcca 
tcagttgatt 
tcctcttgct 
ctctataaag 
agtagaaggg 
tgttccctgg 
catcagactg 
cgggattgca 
ggactagacc 
agcagaggac 
gatgccccag 
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tgttggagga 
agaggcaaaa 
agaaaaaaga 
acccccaagc 
atacgtgcaa 
gtgcacatgt 
ccaacctaaa 
tgtaacatta 
aaggcagcca 
gctgcagggg 
ctgtaactcc 
aggtgtgcac 
catgtggatc 
ggtggtgcac 
aagaccagcc 
gtctcaaaaa 
gaggcagagg 
ggacagccag 
acagaaaact 
cttccagggg 
acagacatgc 
aactaataat 
tagcacttgg 
agtgagttcc 
ataaaaaata 
aggcgttcca 
tccctcagct 
tctgtgagtt 
aatagaaaca 
gactcagtgg 
aaaatttatt 
gggagcctag 
aattaagagc 
actgtagctg 
agccaccatg 
taagtactga 
ctgagaccaa 
ttctccagtt 
gagtagacca 
cagatgggaa 
tggtgctgtg 
tgattattgg 
tggggtttgc 
tcttcgccat 
cagtcctggt 
ctattaatta 
ggaactctac 
catgtgtgac 
ccaatgttct 
ctgatagggt 
cattctagct 
ctcctggggc 
cacagatgca 
agatcaccct 
gacaacactg 
ttgctgcgga 
gagagagtca 
ggtctcagga 
ggattcaatc 
tttttgtgac 
tgtttgcgct 
ctcatctcct 
cctcagtagc 
cattttctcc 



tgctggagcg 
gggatggggg 
tgtctctgaa 
cccacagggt 
tccatggcat 
aattccatca 
gacctacacc 
aagttagaaa 
catcaaaaca 
cctgaattct 
agctccctgg 
acatacacat 
tgggaggagc 
gcctttaatc 
tggtctacag 
aacaaaacag 
caggtggatt 
ggctacacag 
ccagcccagt 
atccaatgcc 
acataggcaa 
ttaaagaaaa 
gagacagagg 
aggacagcca 
aggccaagta 
tagggctagg 
gcatgtggta 
caaggccagc 
tcntacctag 
ctaagagcat 
taaatgttta 
gtttgagtcc 
ataagaactt 
tcttcagaca 
tagttgctgg 
gccatctcta 
gaactaaagc 
ctgatcagct 
gaggccccat 
tttcagacca 
gctgccacag 
ctatgccagg 
cctgtctgag 
gtgaggctcc 
gctggagtct 
attaattaat 
ccatcataca 
cgcatgcatc 
acttctaaat 
ccttgcctgg 
gtctggaggg 
tgcacctgaa 
tgatgggaaa 
ctactggaaa 
gcagtccaac 
accagacaac 
gagacagaga 
ctgagggtat 
tgaacacagg 
acacccctaa 
tatgtctaca 
ctgaaaccac 
aggcagcaga 
cacccttaag 



gtggggcagg 
agaaggcaga 
agtaaaaagt 
ggaaggagag 
acgcagaaac 
ttcaggagac 
caaagaagga 
tctctcttca 
gtgtaaatca 
gttctcagaa 
gatccaacac 
caataaaaat 
tgagaggcac 
ccagcacttg 
agttagttcc 
ccgggcggtg 
tctgagttcg 
agaaaccctg 
ttccagcaca 
ttcttctgac 
aacactcaca 
aaataaggaa 
caggcggatt 
gggctacaca 
attcttggaa 
aatgaggctt 
cctttaatct 
ctggtgtaca 
agcccngcca 
tgactgctct 
ttacttgtat 
ccaaatcacc 
ctttttaaag 
caccagaaga 
gaattgacct 
cagctcttat 
tgggagattg 
cccgtaccag 
ctaaacagcc 
atgtcatttc 
ttggtgtggc 
aaccagtctc 
gccatgggac 
ctggggtcac 
actgagattt 
cacgtgacaa 
tggcgactat 
agccctgact 
gcatgcacca 
catgcatgaa 
ccctcaatcc 
aaaggggact 
gccttctgga 
ggtggtttca 
agacagacga 
aaggtacgag 
cacacagaga 
agtctactgt 
aaaaaagggg 
agtcactgcc 
cagtctcggc 
tgcagcaagt 
gcagagctct 
taccctatct 



agtgggtggg 
tgggatgggg 
acttgtcact 
aaatgagtcc 
gcaaagacag 
agaagcagca 
caaactataa 
cactgtgtag 
acgagaaaac 
cctgcatcaa 
ccatttctgg 
caaaaccagg 
tgggtgaata 
ggaggcagag 
aggaciagcca 
gtggcacacg 
aggccagcct 
tcttgaaata 
cacagggtga 
ctctgtgggc 
aaataaaata 
gccgggggtg 
tctgagttcg 
gagaaaccct 
gaatcccaag 
agcgagagca 
aggctctccc* 
tagctagttc 
aanaaagggg 
tccagaagtc 
tattatttaa 
aagaaaaaat 
aattcttatt 
gggcattgga 
caggacctct 
caggttgata 
aaaaatgcag 
gggtctaacc 
tgcctgcagc 
ccaggacatc 
aggatcaggg 
tcaagcagca 
tcttctgttt 
ccagccgtcc 
accattaaac 
ccccagcgtc 
gaagaacaat 
gctaaaagtg 
aacgcctccc 
gctgtggaca 
actgtgttcg 
gcttaagagg 
agcaactggc 
gtctaggttc 
tctaaacttc 
ctactcccat 
gagacagaga 
agagcatttc 
gggggggact 
tacttccctc 
tcccacttcc 
gacttgtgtt 
gtcttctcgg 
tctctgtctc 



taggtgggga 
gggttgtgga 
aagcatgagg 
cacaagttat 
acaatgagtt 
gagttgttgg 
ggaaaaaggt 
atactgtaca 
cagaaacaac 
gccaagagaa 
cctccatcag 
gatgaagggg 
caataaaaaa 
gcaggcgaat 
ggtctacaca 
cctttaatcc 
ggtctacaaa 
aataagcatt 
ctcacaacat 
accaggattg 
aatctagcaa 
gtgtcgcacg 
aggccagcct 
gtcttgaaat 
gggacactaa 
acttcgctgg 
gaagcagagg 
caggacagaa 
agacctgaga 
ntgagttcaa 
atttaaataa 
gttatcattg 
tattttatgt 
tcccattaca 
agaagagcag 
aaatttaatc 
accaccaagg 
aggcctgtgt 
agctcctctc 
aacacagcag 
gctggcattg 
gctcttctcc 
gatggtcgcc 
ctgctgcctt 
agcaacgttt 
catatgggtg 
gtgacagaaa 
gacaagcccg 
acaggaccag 
cgaggcatta 
cgctgttcca 
gctcctacca 
tgccaaaggc 
tgtgggattc 
caaggcacag 
acaacataca 
gagagtctaa 
cccagcatat 
tcgattatct 
accgccatga 
tcctcccctc 
gactgccaca 
agcttcttct 
tgcttgttga 



gcaccttcat 
ggggtaagaa 
atatgagtca 
tttctgatct 
gggtgtggtg 
aaatctaagg 
ggtcgaccaa 
aggaagagaa 
tcaagagaag 
tcaaaactgt 
catcactcac 
tagggaggtg 
tttggtgcat 
ttctgagttc 
gagaaaccct 
cagcacttgg 
gtgagttcca 
tgttgctgtt 
cataactcca 
catacagtgc 
aaaaaatttt 
cctttaatcc 
ggtctacaaa 
aaataaataa 
gtgtatataa 
tgtatgaaag 
cagaaggatt 
agggcgatat 
ccagagagat 
ttcccagcta 
ataagtaaat 
ctaataatca 
atgtaagaac 
gatggttgtg 
tctgtgctct 
tcgtggagcg 
ccctgctcat 
ctgcttccct 
tctaggtgga 
ccaaatttat 
gcacagtgct 
tatgccatgc 
ttcctcatcc 
gactccatgc 
ctctaaaata 
tggaaaatga 
atgctaacat 
aagcgaaagc 
aggtgcagct 
ttcgcaagaa 
gcaccagtgc 
agcctactgc 
tctggacaag 
caggaaatta 
ctggtagaac 
aaaaagcaga 
agagagtcag 
acaagaccct 
caaattctcc 
agtaaagagc 
tgcttctgtg 
cggaaactct 
ctcttgtcgc 
tccttggacc 
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cttttccttt 
cgcccccact 
ctctagctcc 
aaaatattca 
tgctatgact 
nnnnnnnnnn 
nnnnnnnnnn 
agagcagcca 
accaccacca 
ggttggttag 
agaaactcct 
aaatgtgaca 
acagtctgta 
cctttgcaag 
tatcagattt 
ggaaagccta 
aacagcacac 
gagtacgata 
tactgtgtag 
acagagtcat 
ttcaaggcta 
tggacatctg 
aagcatcacc 
aggtggggtg 
gtctactcca 
tctagttttc 
caggaatgca 
ctttcctttt 
tatgtgtacg 
tctgggttac 
aatcaccccc 
acagacaagg 
acctgagttc 
ttgtcctctg 
gaataaataa 
tgggcaacaa 
gaaaaaacaa 
gggataaaaa 
tgtctacaac 
cctggaggta 
ggagtactag 
aaatttagaa 
attgaggttc 
agcatattgg 
catattcaga 
aaaaattctt 
cagctacagt 
tcaggctggt 
gttcaaatcc 
cttctggagt 
aaaaaaaaaa 
gtaacccaac 
aaagagaaca 
cgagacacca 
agctgtgtcg 
tgaaatccaa 
acgccagagt 
"tgcacaggag 
caattcccag 
aatgcccttt 
tgcatacata 
cgggcgtggt 
tgagttcgag 
aaaccctgtc 



ctatgaacaa 
gtggagaaaa 
atgacttaaa 
gtcaactcct 
cccagatcca 
nnnnnnnnnn 
nnnnnnnnnn 
gtgttcttaa 
ccaccaccac 
tttagtctgt 
gaaagtgaag 
tgtaaataca 
gcctgaatta 
gagaagggca 
tattgctttc 
atcttcatag 
gtgtatgagc 
gcacgtgtat 
gtctcagggg 
cctgccagcc 
actcactgac 
tgctggtagg 
aaggagttcc 
ggtgagcctt 
tgaccaaaat 
tttctttctt 
agagctctgc 
agattttatt 
tatcagtatg 
aggagtttcg 
acccccaccc 
gcctgcaaga 
actgtctgga 
ctctccccat 
aactaaaatt 
taacaccgag 
acacatttcc 
caccctgaca 
atgacctgtc 
gaagctgata 
cctaggggct 
aatgcactac 
cttcttccaa 
gattatagat 
aattaactca 
acaaccatcc 
gtacacacat 
gagatggctc 
cagcaaccac 
gtctgaagac 
aaactatgaa 
agaaaaacac 
gcagaggatg 
aaatcacaat 
ttaaatacca 
gctactgggt 
taactcagag 
agatggctca 
cacccacgtg 
tcacagtacc 
cacaggcaaa 
ggcgcatgct 
gtcagcatgg 
ttgaagaaaa 



aatatctcct 
cccagggcct 
gcatctctgt 
tgccattcaa 
tccacttggt 
nnnnnnnnnn 
nnnnnnnngg 
ctgctgagcc 
caccccacca 
gagacaggag 
gcctacaaaa 
tgctaaaata 
cagacacgac 
gaagcttcca 
tctttctgtt 
cccatgtatg 
accacagcat 
agagttcaga 
tggaacttgg 
cagctttctc 
tctgagttat 
aacaccatcc 
accacctcaa 
gtgatttatc 
ggtgacggtg 
ttttcttttt 
caagtgagct 
tcgatttatt 
ccatgggtat 
agctgtcctg 
agccttcgag 
tggctcagca 
acccgcgtaa 
gtgtgccatg 
agcttagtaa 
agtagaaagg 
tgtacatcgt 
aaaattaacc 
acccagggaa 
cagaagccat 
gtactgccca 
aggtttgcac 
aaggacttta 
attctcataa 
atgtgaacca 
ataacaaaaa 
ataaataaat 
agtgggtaag 
atggtggctc 
agctacagtg 
gaactatgaa 
ttgcctggcc 

agggcatcct 

gataaaagag 
cgcaggaagt 
ggactgaggg 
agcagcttgg 
gtggttaaga 
ggcacttaca 
aggcatgtac 
acttacataa 
tttaatccca 
tctacagagt 
taaaaaaaaa 



taaaggatct 

tgcacatgct 

gctgtcaaat 

aatggatgac 

agcccaggaa 

nnnnnnnnnn 

ttactgggat 

atctctccag 

ccaccccgcc 

ctgtcccttt 

cagccacact 

attcgttaag 

acgagccatg 

tccttgctag 

tttctttatc 

tgagcacttc 

aggtgtgtga 

ggagaactga 

gctcagcctt 

tttttctctc 

cagaactgct 

ccactcggct 

ccagcaaata 

gttaggatct 

gctgaccaaa 

cttttacccc 

acattcccag- 

tgtctatgtt 

acagaaacct 

tgggttctca 

ttcaattctc 

ggtaacgaag 

agggggaagg 

gaatgaacag 

cttttatgtt 

caagcatgta 

tctgggagtc 

tggggaggaa 

gtcagggcag 

ggagctgctt 

cagtgggcta 

acaggccaat 

gcttgcatta 

aaaaaagaca 

gaagctctga 

tctgatgccc 

aaataaatat 

agcacccgac 

agaaccatcc 

tacttacata 

ctacaagaag 

tgcgtttggt 

gacttgagtc 

atcaacaagt 

gaagaggagc 

aggacaacag 

gtagctttaa 

gcattactcc 

atcatccata 

acagtgcaat 

aatactaagc 

gcacttggga 

gagttccagg 

aaagaaaaaa 



cttctagttc 
cagcaggagc 
atacacttcc 
ctcaaagcca 
tgaactcann 
nnnnnnnnnn 
ttgaactcag 
ccaccaccac 
ccacctgctc 
tctatagtgg 
tatttgttgg 
tcagtgaaca 
acagaggctg 
caaatctttg 
atatttgttt 
agcatatgtg 
gtaccacagc 
gagagtccgt 
ggtggcaagc 
tgttatgtct 
tgtgagagca 
tggatgacga 
tttacctcct 
catgggagtg 
aagaaacagc 
acggtactaa 
atctgttttt 
tgtgtgtatg 
gagaaggcca 
aatgcagcag 
atcatcacaa 
ctcgtgtcat 
gaagaatcaa 
ccctcccaaa 
gaaagtggtt 
tgtcactgaa 
tgagttaggg 
gctgtttatt 
aaattcaaac 
ctggcttgct 
ggctctccca 
ctggtagggc 
tgttgacata 
tttagattcc 
aagtaagagt 
tcttctggag 
ttaaaaaaat 
tgctctttcg 
gtaataagat 
taataaataa 
tcaggaatag 
ctctagcacc 
aagtgacaag 
tgggctttat 
tgagtctggt 
ctagctcaag 
tgagactctg 
tcttgcaaag 
actttagttt 
tacatacata 
agataaatct 
ggcagaggca 
acagccagga 
atcttaaaag 



agggtccccc 
tccatccagt 
agcccttacc 
gagtcagcgg 
nnnnnnnnnn 
nnnnnnnnnn 
gacctttgga 
caccaccacc 
attcctgatt 
aaggtgaata 
aaaatactgt 
accttaaaac 
aaataagacg 
ttccaagctt 
atttgttggg 
tgtgaacacc 
acatgtgtgt 
cttttcctcc 
tcctttatcc 
atccactctg 
ggagtaactt 
aggggaaaaa 
atacatggat 
attacagctg 
tacacctggc 
ggattgaacc 
ccatttcttt 
tgtgtatgct 
gaagagagtg 
caccaccacc 
aaacacacac 
aagcctgaga 
ctctatgatg 
cacacatcaa 
tttacatgcg 
cagcattgaa 
tttctatgct 
tcagctttta 
aggtcagaag 
ctagcacaca 
cagcaatcat 
cattttctca 
aaaactagcc 
cacataacac 
taaaactatg 
tgtctgaaga 
atatgaaaaa 
aaagtccaga 
ctgactccct 
ataaatctta 
ggctgggggt 
accaacgtag 
tgataatcct 
ctgaataaag 
aacacaggcc 
gcccacctgg 
cccaggccag 
gacctgagtt 
caggggatcc 
catgcatgca 
taaaagaagc 

ggtgtatttc 

ctacacagag 
aaaaggagag 
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44820 
44880 
44940 
45000 
45060 
45120 
45180 
45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
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gactggagag atggctccac agttaagaac acttgttctg aggtctacag agtgagttcc 48420 

aggacagcca ggactataca gagaaaccct gtttcgaaaa accaaaacca aaacaacaac 4848 0 

aacaacaaca acaaaaccac ttgttcttac agaggacttt ggtttgattc tcagaatcca 4854 0 

catgatggtt cacaaccatc agttgcaggg atccaaggtc ctgtcttctg tgggcaccag 48600 

gcatatatgt ggtgtacata catgtataca ctcatataca taaaataaaa agttttaaaa 48660 

aggaggctgg gtttgtagcg cagaggtaga ggtaaaaaga ctctagcttg tttaatgttg 48720 

acatgaaaaa aaaaagacat ttagattcct gcatcacacc atatccaaaa attaactcaa 48780 

tgtgaatcat aagctctgaa agtaagaata agcctagtat gcactgtaag gctctgggtt 48840 

cactccccag cactgcaaaa gatcatgaaa ccagaaatgc agatcctctg aaccacagca 48900 

tgggaatgta actcagccga tgcagtgctc acctgtcgta tacagagcac aggataaatt 48960 

gattgtggtg gtgcatacct ataagctcac tacgtggaaa gtagaggcag gacgaccaaa 4902 0 

ggttcagtga catccttggt cacatagaga atttgaggcc agtctggtct gctggtctat 4 908 0 

ttggaatgct gtctcaataa ataaaagaaa gaaagaaaaa gaaaagaaga agtcctatga 4914 0 

ttgtcttaac ctctgacctc tgtgttcatc aagtctcctc ctcaggaact cactggtcat 49200 

cttgtgaaaa cctaccccag agtctctgtt cagaggaccc aggctccagc tgtggttacc 49260 

acataggatt tttatactag aaaaataaaa tgaataagta tgtatttttt aaaaaggtgc 4932 0 

agagctggat atggtggtgt ctagttatag catccagaac tgagacagga tagccatgag 4938 0 

gttgagaaca gctagactat acggtctcaa caaacaaaag taagggatct gagtagatga 49440 

ggttttaatt tttttctttg tgtttgttac ctaacgtgta tggttgtttt gaatacatgc 49500 

atgtctgtgt atcacttgtg tgcctgaaac ccaaggaagc cagaggaggg catcgggtcc 49560 

cccggaagta ttattacaga aggttgtgag cagccatgtg ggtgctggga atcaaatctg 49620 

aaagagccac ctcgggctgg agagatggct cagtggttaa gagcactcaa tggctgctct 4968 0 

tccagaggtt tggagatcaa atcccagcaa ctacatggtg gctcacaacc atatgtaatg 4974 0 

ggatccgatg ccctcttctg gtgtgtctga agacagctaa agtgtactca aataaataga 49800 

tcaaaaaaga aaaaagaaac agccacctct ccactctccc tttttaaaat cctcttgcct 49860 

ctgtccctta atgttaataa cacaggtata tgatactatg ccttgtttat gaatagaaaa 49920 

tacacgtgct aaagcaagtg tgaaccttaa atacattatg- ctgagtaaaa ggagtgagtt 4998 0 

gcacacaaga cttttctgct caagagtatc tgtatgaagt attgaacatg tgaactctga 5004 0 

aatcgggagc tgaggaagat atggggagtt ctaatggcta caacatttct ttttggaatg 50100 

atgaggatgt tctagaactc aaaaatggtg ataactcagc atatatacta aaactcattg 50160 

aattgtacac tttaaatgaa tgcaataaaa cttgtctcag taatgtggtt tagaagatgt 5022 0 

acagacatgt gtgtgtgtgt gttaaaacat ttcttggcat ggcaataaaa atacagtttt 5028 0 

agccaggtgg ttgtggctca aaaaataatg ataataacaa taataaaaat aatgaaaaca 5034 0 

gaggctggag agatggctca gcggttaaga acactgactg ctcttccaga ggtcctgagt 50400 

tcagttccca gtaaccacat ggtggttcac agacatctgt aatgggatct gatgccctct 50460 

tctgatgtgt gtctggaaac agctacagtg aaagtcattg caaggacttt acaatagtga 50520 

ccatgataac attgaagcta gacttgctac tactgctgag tgtgtctgct ggctctttct 5058 0 

aaggagtaat gttagctttt tgtcctaaat ttgtttcctt cctttcctct ctccctctgc 5064 0 

tgttttttct tacccctctt ttactttgct ttcccctctc atctcctctc ttaacagagt 50700 

tgtcctatgc agcccaaatg ccatcttcct gcctcagcct ccccagtgtt gaaaaatact 50760 

ctttccacag gttatgttag gagactggag tctgctcagt cggggaggga gcctgggtca 5082 0 

agttctgagc tcaattcctt ttctttcttt ctttctctct ttctttcttt ctttctctct 50880 

ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt taagacaggg 5094 0 

tttctttgta taccctggct gtcctggaac tcactttgta gctggcctgg aattcagaaa 51000 

tctgcctgcc tctgcctccc aagtgctggg attaaaggtg tgcaccacca ctgcccagcc 51060 

ctgggctcaa ttcttaacat tgtggagaga aaagtattgt agctgttctg gccacctgga 51120 

attactttgt ttctgatctt ttgctgcagt caaatccttc tcatccatct ttcctcgtca 51180 

ggctataata tagactctcc ttgcaatact tggaaatgct ctacagtcag ctacatcctc 5124 0 

agtcctgctc ctatattttt tcctaagctt ccttctaagg tctttattgg tttatgattt 51300 

acacagaaca tttttttttc ttgtctatag catgcgttag agtgatcgtt gccagataga 51360 

ggaaagagaa atgagagaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 51420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt 51480 

cagctactga ttcctcctcc tccctcctcc ttcctccctc ctccccagcc tcatgctctg 5154 0 

ctcatcttgg acttctgcgc atgtcctcag cccagacctt ctgctcttgc ttctcctctc 51600 

cccagcagcc ccccagttct cttcctgaaa cttctgaggt actctccatc acctcctttg 51660 

gctcctgctc tgattggtgt cacctgctgg ataggcttgc tcctgactcc actgttcgtg 51720 

tctcaattag ggaccctcac cctctgatat accacacatt tccctagtgt ctccacctcc 517 80 

cacccccacc ctatacgcac atacacactt agctgcatca ggatcctaca ccagggactt 51840 

cttacccttc taatcctccc caccggacac tgcccaggga cactggggct ccagagggct 51900 

attgccacac ggacacacag gagatctcat caaggagatg tgcctacccc agagggtagc 51960 

tctcaccatt cacaagcaca ccacttctgc ctccagcttc tactctctcg caggaagtag 52020 

ccagcccggt gccaagtatc cccaactaca tccccaaaat tctcagacac tgccagcctc 52080 

cagctgtcag cctggccccg gctggcgggc gcctgctcct ggcatagcga ctagggtgta 52140 

attagaaacc cgctagctcc ctaattgcca gttctgagct gtccttgtta ccggctgccc 52200 
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gaggcacaca 

tagacctaga 

gcaatctgcc 

ccagcccggc 

aaagagtctt 

ggaggacaag 

agaggggtga 

actcttgtag 

tgggagacca 

tggcctctcc 

ctacctctag 

ccttgggaag 

gcaagagaac 

tggttagaaa 

gcccagggcc 

tgcaagtttc 

gcctatgagc 

gttagggttc 

gaagaggaag 

ggtagctgcg 

gctagacacc 

ctgcaggcct 

tgtctagcat 

cctcagtcta 

ggatctgcgc 

gaagcagggc 

taggggcacc 

ctgctgttgc 

atggtacaga 

agtgggcatc 

ataagaaaag 

aaaattactt 

taactggtga 

gataatgctt 

gtgtgtttgc 

ttgctctcca 

aagacaggct 

gcctcgaact 

tgtaactaca 

ttgaaatgac 

ccaaaggtcc 

gatctgactc 

taaattaatc 

ctgccccaag 

catgctaatg 

ttcttttaag 

gtaccgtgtc 

gttgtgagct 

taagtactct 

tctaaaagta 

gttgaacaaa 

acagactttg 

tcagcaagct 

aggtggggct 

ttgcccctcg 
ctctctctct 
ctgcttcaac 
cctttttttt 
aggctcaggc 
ggatggaaag 
cactcatgtc 
ctaagtggag 
cccctccaca 
actaaccaac 



tagaggaaaa 

gaggactggc 

ccgtgggtgc 

ccattcattc 

gggagctaaa 

agagagaaga 

cttgggtgac 

ggtaggtcag 

acacttgtct 

agcctctgct 

ccaaaactgt 

aaatgaagat 

agcttctaca 

gagtcacagg 

cctcctcact 

aagtcaggac 

ctgttgtcag 

ctcacactgc 

ggtcggttca 

gctgcctcca 

accagattct 

gcccagaaca 

cctggctaaa 

gacaacactt 

taggtacagg 

cacctctggc 

ctatccccaa 

agatggagtt 

gtatcccagg 

acgaccacaa 

gagagaacat 

ctgtactttt 

agtggaataa 

ttacacctct 

tcatgggtat 

cttttttttt 

ttctctgtgt 

tgcagagacc 

catagctccc 

tgtttggggc 

tgagttcaaa 

cctcttctgg 

ttttttaaaa 

tgctgggatt 

tatcccaggc 

gtttattttt 

cttgcagtac 

gcatggtggg 

taactgctga 

ctagggatgg 

gggctttgtg 

actcttctag 

gtacaaacaa 

actggctttc 

tctttcaccc 

ttaaacagga 

cttctgagtg 

tttttttttt 

ctccctgggt 

atggagccag 

cactgggaga 

ctgagcaggc 

tggctcctcc 

agaatggatg 



ggctgagagc 

atgtggccag 

ctcccagcca 

tttattcagg 

gggtcccttc 

aggaggaaga 

aggacacagg 

cctccaacca 

ctccttttcc 

tacaaatgtg 

gtccacaagt 

tccccatgcc 

ctggtgatgg 

agctgagagg 

tgggggagct 

caaaggtctt 

gggaatctcg 

ccacaaagag 

cagtgaagga 

caccagcctc 

gctcagagat 

gcagatgact 

gggaacactt 

ctagaaacct 

acagaaagtc 

tcaggcacac 

gctccttacc 

taggcagcca 

tcaggttggc 

agctcatgtt 

gaggaccaga 

gagacaagag 

taccacttac 

ctgtgtgcac 

atggagtcag 

tttttttttg 

agccctggct 

cacctgcctc 

tcttttttgg 

tggagagatg 

tcccagcaac 

agtgtctgaa 

agagaaagaa 

acagagctgt 

tgtcctcaga 

atcttatgtg 

cctcacagtc 

tgctgggaat 

gccacttctc 

cggatgtgcg 

catgccaggc 

tagtagtgtc 

gctatgggcc 

aaaatgaccc 

tctcccagag 

tctggttatg 

ctgggattat 

tttaaagaat 

accagcataa 

aggatctttc 

tactgtagta 

tcgtatcctc 

ctgcagaact 

agcatgtgtg 



tgagccaggc 

ggaccaaacg 

catccatttg 

tggcataaaa 

cttgcctcag 

ctcctggcag 

ttggggacct 

tgaagtataa 

cacccagggt 

gagggaccct 

gccagcccac 

tgccttcctc 

tccttccggt 

ctgatccagg 

gaactggggg 

gcctccatgg 

ctgttcctgg 

gggcacgcct 

ggagagggac 

gctgagctcc 

cccacgaagg 

ggtcagtgct 

gaacccagcg 

tttttttttt 

tagaggagca 

tggcagctcc 

cagttggctg 

caccttggtg 

tagtacctcg 

gttcttaaag 

cagcacctgg 

ccagaaattc 

ccctttgcaa 

acataagcat 

aagtaggtaa 

gtgttgccat 

gtcctggaac 

tgcctcctga 

acacagggtc 

gctcagcggc 

cacatggtgg 

gacagctaga 

atgatggcta 

acaacaagcc 

ctctctatgt 

tatgggtatt 

cagaggaggg 

caaaccctgg 

caccttgagc 

ttcatgtgcc 

aagcactcaa 

tccactacag 

ttcctgtcct 

atagaggagc 

agtcagctat 

tggccctaac 

ggtgtaagcc 

ccatgcagtt 

tgcctgcagg 

ctctctgaac 

ttcaaggaag 

tcaccacggg 

ggcaatgctg 

gtgccacaca 



tggcatggag 

tggcacagag 

cccagaactg 

atcactacaa 

tccccaagat 

tgttggcatc 

gaatgtcttc 

caccaaggcc 

gtctggaata 

aagttaggga 

aaaagatcac 

caggccccac 

cccaccctat 

tggggactca 

tagtcttcct 

aagtcagctt 

agctggggca 

atggtgtcct 

attcgattca 

atggtagact 

tctgggccct 

gccccaaggc 

gttgattgga 

tttttttttt 

tatcaaatgc 

cgtactctgc 

agggtggcca 

ggcctctcct 

gacacgttcc 

gggaaatgag 

acctgtctgg 

agggttagca 

ggtgacatgg 

atatgtttgt 

acatcagtcg 

ctttttgttg 

tcactctgta 

gtgctgggat 

tcatggatcc 

taagaacact 

ctcacaacca 

gtgtacttac 

catacttctc 

caagtttgtt 

aattcagaac 

ttgcctgagc 

caccatttcc 

tcctctgcaa 

ttttcttcct 

tggtttatgt 

caactgagct 

cctgagttct 

tgcctctcag 

cacagcaaac 

aattcgagtt 

tatcttcaac 

accacactca 

aggacagcat 

cgggtcctct 

ctcaatgtcc 

aagcaacagg 

ctacagagaa 

ggcccggctt 

cctgggaccc 



gtagccctag 

agggctcagt 

tgacgtcaaa 

aaactttaca 

tcctggcagg 

tccaaatacc 

agcaagggac 

agtctaagct 

tgtctaaaga 

cttgcctaac 

cccctgagcc 

cccacctgct 

cccacaaagc 

ggatgctgct 

ccatgcgggg 

tgtcattctg 

gcgcgctggg 

ccatgatgaa 

tggctacgct 

gatgttgcac 

ggaacaattc 

tatgtggatc 

atctgttaga 

ttttaaatca 

tcccatccag 

ccagaccacc 

ccaggtccag 

gcagcgaggg 

actcaaaata 

ccacctacag 

agtctgggcc 

tgctttcact 

gaccaaatga 

atcggtgtga 

tcttcctaca 

ttgttatttc 

aatcaggctg 

ctaagatgtg 

caagctggct 

gactgctctt 

tccgtaacaa 

atgtaataaa 

tctcgtctct 

gtgttttaga 

gaccttgaac 

atttgtctgt 

ccctgaactg 

gagaagccag 

cctatctcga 

gttgctaagg 

acacatcccg 

ctatctgctg 

ttctctccgc 

agtaggaagc 

tttttttcct 

ttcagtcttc 

gctcacacaa 

ggaaatgacc 

gccagtgggg 

cacagtgaga 

aaggtgagag 

gtctggctgc 

gcccagtcaa 

cagcactcag 



52260 

52320 

52380 

52440 

52500 

52560 

52620 

52680 

52740 

52800 

52860 

52920 

52980 

53040 

53100 

53160 

53220 

53280 

53340 

53400 

53460 

53520 

53580 

53640 

53700 

53760 

53820 

53880 

53940 

54000 

54060 

54120 

54180 

54240 

54300 

54360 

54420 

54480 

54540 

54600 

54660 

54720 

54780 

54840 

54900 

54960 

55020 

55080 

55140 

55200 

55260 

55320 

55380 

55440 

55500 

55560 

55620 

55680 

55740 

55800 

55860 

55920 

55980 

56040 
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acagctgggg cagaagggtc atgagtccaa agcgaacttg tgtaacattg tcagaccctc 56100 

gaacaaacaa aactagcccg tcctgttatc tcagccacag atgatgggcc caaggatcag 56160 

tactctagcc aaggagtcac ggttaggcta gaagcaaggg aagccttagc tgagacagct 56220 

tggcacggag cttcatccaa tcagaatgtt cagagcaata agctttgaaa cccgacttcc 56280 

atctatgaag cactgtgtgg gaactcctct cttcccttac gagcagggcc ctggtcctct 56340 

tgggctccgc taaaacccca gcacagagaa cagttacctg gcacgtgaca aaaactcaat 56400 

atattttctt tgaggagatg aacctcaaag aagctgtgtc ctggatagac acagcataat 56460 

aaacccttca ggagctacct acccagggac cagactttac ctcccagtac caggcctcgt 56520 

ttgccagcca aaggcaaagt ccagactgac ctgtatctca ggttgctcca gcaggaacca 5658 0 

tcgaagagga tatgacaccg cgtgcatcat gtccaccgac actgtgaacc gctcatccag 5664 0 

gtggaagaaa tctttctggg tgaggctcgg gtcaaacttg gtcctccaga aacctgcagc 56700 

caggcagagg gcaggagcca tgtaacataa aatcagcctn ctgcctgtct tgcctagaac 56760 

ctatnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 56820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaaccaa ggcaggtctt 56880 

ggaaaaagga atcttaaatt agaagatgcc ttgataagat tggcatgtag gtatgtctca 5694 0 

ctaatgattg atgtggaaag tcacgaggga tggtgtcacc ctgggcagat ggcctggggt 57000 

atataaaaac acaggctgaa caaaccacaa agcagtagtc ctcaatggct tctgctttag 57060 

tttctgtctc aggttcctac cttgacttcc ctcagtgaag gcatgtcaca tgagagttgt 57120 

aagaggaaat aaaccctttc ctccccacat agtttttggt tatgatgtta tatgtcaaca 5718 0 

acagaaacta taactaatat agttggtttt ctttttttgt ttgttttgtt ttgttttgag 5724 0 

acagggtttc tctgtatggc cctggctgtc ctggaactca ctttgtagac caggctggcc 57300 

tcgaactcag aaatccacct gcctctgcct ctgcctccca agtggtggga ttaaaggcat 57360 

gcgccaccat tgcctggctg gttttctttt ttttttaata catttataat gcattttaga 5742 0 

tttaaaaaaa aaaatggcca tggcatataa tataaaaaga agtgcttaca aatcaccatg 574 8 0 

tgcccttgcc ataaattatg taaaaatttc catatggaca tcagtctcaa gcttacaatc 5754 0 

tcagcactca tgagcctgag gcagaggcag gaggatggtg agctcaaggc cagcttagtc 57 600 

tacataacaa gatcctgtcc aaataataac aacagtaata atttcataca tagaactaga 57660 

aggggccact gcaaagacag tatgacaaaa ccactggccc tgcctaattg tattttaaat 57720 

aactgtcctc ctctctgtaa ttttcagttt ctaattttta cataactacc atgtattctt 57780 

tttgtaattt taattagttt tttaataata gaaacaagct aagtgctaag aatattttca 57 84 0 

tatgaacatt ttcaaggcac ttgatacata cctcagattt gccctccagg tgagcagtac 57 900 

caattacgtg ccaccagcaa tgttagcttc cttttttccc taccatctga ttctgtttca 57960 

gtctattcgt agttctgatc ttgttatatc cctttttatt gtttccctgg gttccaacac 58020 

ctcccagttg agtgttctca ttgaatttca ttagcagctg tttcattaat ggcacagaag 58080 

aaggattaca gtgttaacta ggatagactt tgacaaagaa ctatgagaac atatcttatt 58140 

atctttgcat aaattctttt taatcaaagt tcctcaaaag cctctctctg ttcccatctc 5 8200 

agggagtagg tctggccact gatgagtgtc caggccacag tacaggtgtg cgtggttctg 58260 

tccctgtggg aagggcacat ctgtgttgta acaggattcc tgtcttaaca agccttgctc 58320 

aggctctaag tggtcctgag ctagctaact gcccttggct ttcccttgat taccagataa 58380 

ctattcactc ttctcatttt gcagagcact taccaggtag ctatgtcctg gaagtacgaa 58440 

tgagtccttc tattgttttt cttttactta aatcccattt gaaatgcgcc agggacactt 58500 

caatccaagg tacacttttg ctaaagaatc actcattttt atatgcaaaa tgtcacctat 58 560 

taactgcagc tgatatggta catacatatt ctctcttcct attatccact aataggtgac 58620 

taatgcgaaa tattgagtaa tttttaaaaa tcaatactca attttttaga aataattaga 58680 

gagacattca actctgacac cagcacccta ctcagttcct gagccttcct ctgccggagg 5874 0 

agaatctata aataactcac gaagctgaca ttactcactg tgttgcagtc atttttttct 58800 

gagaaaattt tagcaactgt tctaatagag cctgccagtt atcagtagtt gagaatgcaa 58 860 

gtcaactttt aattatgcag acgctgatta ttcagacgac aaattgttgg tgcctgcacg 5 8920 

gctccttcct gctgcctacc tttaaccgtt ctcagtgctc attagcacat gttccagaag 58980 

gtaggctttg gaggggcgga caggcactca aaccagctaa gcacttagag aagctctgat 59040 

gaaagatgtt aatgcagttt gtagaattat tgactaaaat tgagtcattt ggattccctg 59100 

tgaattgtat ttacatgccc tgtccctgtc ccccatagca acagataata ggattgtctg 59160 

cagagagaca acatagttct tatatttaat tttttccttt gtcgaacatt ttcacatgat 59220 

ggttcgtggt gtttcctttg ttcattacat ttgtatccag actagttact tctgataagc 59280 

ggttagttag gattcctggc acgcggacag tgacaccaca gttgtctgat cgtttcccac 5934 0 

ttttttacaa aaccgtttgc ctttaagagt cagtgttttg cacatttcac ccagattatt 59400 

ggaaatatta tttccctcct gcttaaaccg aagctgtgat cataatttaa gcctttctag 59460 

gtagccgatc ttacatgtat catacctatt tctggcatat gtttgtctat tacaaagacc 59520 

tcgtaggtat gcagttagaa gcctctagtt aaatgaaatg ttgcgtgtgt gatgaacctg 59580 

gagtggggat ggccttttgt gtgccccaag gctgttgtgt ttcacacagt tgttttctgc 59640 

ctcctctggt ctatcactat cctgccactg ccagaaaacc ctgctgtgtg ttccccgcgt 59700 

ggaggatctc tgcttctgaa cttctttggc ctgagaaact ccataaccaa atcagttagc 59760 

attttgttta aagagcaggt aggctgttag agcttgggtc ttacatgtct cccaggtcca 59820 

cttgccagcg ccttgaccac tgttaacttt tgttaaccaa ctcatctttt gctgcctgtt 59880 
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ttttgggggg tttttttggt tttgtttaag ccaagatcag ttatatggcc caggctgagc 59940 

ctctcttccc agcctctcaa atgttagaat tacaagcatg catccctcag catacctttc 60000 

ctttgctttt tttaaaatag agttttgcca tagcaacaga aatctaacct aactaagcat 60060 

agccgtgcac atggtatgag gaactcacat atgtgtgaat ggaagttcat agagaccggc 60120 

atcactgcct agaggcccct ttcttccttc cttgcagttg tcgtgctagc tgactgtact 60180 

acaaaagagg ttgtctgagg cataagacta ccttcaataa aacatgcaca gacagtttgc 60240 

ttctctgaga tttcagagca gtgactacct tcaataaaac atggacagac ggtttgctta 60300 

cctgagactg cagagcagtt tccaaaaatt ttagacaaag ggtaggatga agaaggctgc 60360 

ggggttttgc acacacttaa ggtgcgtaag taaataaact gagctacact gacaggatgc 60420 

tcgttctagt agccaaccaa agagcagttg aaccaaagca cctagacttc aaacatcgtg 60480 

gggagataat cttaggagtg ctatgcttct gcgtcctaca agtattatga aactgtctag 60540 

aaagcacccc actggtaatc cctttttgat tatttttttt ataaattcta gtcttggggt 60600 

tttgagtggc acacagacat aatggttagg cttcggtgtg tgctcattca ctttgcttcc 60660 

tggggaccag agtttgcgat gagtcatgtt ccatctgatt tctgtcggat ccggctgcag 60720 

agccatgact cagatgggct tcaggcccag ctgctcagtt catcttctgg ggaatagatg 607 80 

acaaggacgg gacaaatgtc ctgacgcaca tttccttctg ttcttgcact tccagggtct 60840 

aacgagagca tcattaccaa cagcaggcag atacgccttg ccacaggcat cttccctgtt 60900 

gtcagcctcc tgaaccactc ctgcaggccc aacaccagtg tgtccttcac tggcactgtc 60960 

gccaccgtcc gggcagcaca gaggatcgca aaaggacagg agattctgca ctgctatggt 61020 

gagccagcct ttctttccac taccctgctg tgcctcacac ctcacatgaa aaggataagg 61080 

ggacaggaat cagcagatat gggcccagtg cctctactca tcctctgagt ctttcctgga 6114 0 

aagggcaatg catccttggg ccaataaaaa aggtcttctg gctgtaataa aaaagcccgt 61200 

tgagggcagt gagccatatc cctccatgcc ttgtagacag cctatcctga aaatgagcga 61260 

ggagcacttt cttggcttct ttcttcctgc cccagcagct tggaaacgta tccactttca 61320 

cccgtgtttt gttgtttttt ctgagatgat agggcagagt acccaacctc atataggcta 61380 

ggctagtgtc tatcactgag ccaggacccc aacccagcac caccatgcca gtcacgtgat 61440 

gactaggcca gcccctcggt agagtaggca ttgactctct tggtgtgact aggaactgtg 61500 

ggtaatctct ctccagggcc tcacgagagc cggatgggcg ttgctgagag gcagcagagg 61560 

ctgagttctc agtacttctt tgactgccgc tgtggggcct gtcacgctga gacactgaga 61620 

gcagctgcag ctcccagatg ggaagccttc tgttgtaaga cttgcagagc gctcatgcag 61680 

gtaaatctct gctgttccca ggggcagggc tccagctaaa ggttgtcagt cgccaggaga 61740 

accattcctg cttcccttct tgtaactcct ccctacatgt cgcccggtcc tgcagaaaac 61800 

acaggttgta tttcctaata ttttccctat aagtgacaca aaatcttaaa ttacacaaag 61S60 

ggaccaaaaa aaaaaaaaaa aaaaaagccc tagaaattta cttgctcaaa taagtcatca 61920 

aaagttgtgc atcaggccta gcacttgggt actggtaacc ctagcactca ggaggctgag 61980 

gaagaaggat ctcaagtcgg aggccagtct caagtgacac cccatctaag agatcaccat 6204 0 

tccaaggagc tatttcagag atggtttaat ctggggaccc agattgtgga ttttctgtct 62100 

gttcaattcc atctctctgt gctggcctca tcagacacac tctgtagtaa ctgtgggaaa 62160 

atccgaccca catagttttc cctcagcctt tgacccagag ggaagagcca cagtggagag 62220 

catgagagca gacccttggg tgctactgcc aggtaatggt gtagacactg gagtcttcaa 62280 

cattcatgcc ccaatgcaaa atggtctcca caccagagca tggcattctc attagaaata 62340 

agtaaatgga attggctgtg ttgaaaattg taaagccaag ggtcaagaat gaagccttcc 624 00 

ccagcatgtt ttgttttgtt ttgtgtttta ggcagcgtct ctctgtgtag ccttggctcc 624 60 

tgccctctgc tacctctccc aggtgtgcca ccatgctggg cctaagcgcc ctgtgcatta 62520 

gtgctccctc gatcctgctc actcttgaga cagtcttcct tctactctgt atccccagat 62580 

aacctagagt tcacttcaga gcccaggctg gcctcaaact tgagatcctc gtgtcccagc 62640 

ttctcaaatg cagtgatatt tacaggccta cacctggctt tccctgatag attcctagta 62700 

agatgattat cctttgagcc atatctctct tctgcttctt cctctcttcc tgcagggttg 627 60 

atctagaatt tattctaaag ctgactggcc tcagaattgc catccttctg cctttagnnn 62820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 62880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tagtgtcaac tttcaaattc 62940 

tgccttaaga gttctttgtt tatgggaatt tatgggaatg ttccacagaa cccatccagc 63000 

ggagttctgg ctgttgtttt ttaatcttta ttcatcttgc gtgtgtgtgt gtgtgtgtgt 63060 

gtgtgtatgc gcgcgtgctc aacttgcaaa attgcaaaat tcagtctcct ctttccaccc 63120 

tgtaggtcct ggggatcaga ctctgttagg cttggtggta ggtgctttac tgagccatct 63180 

tacaggcccc ccatggacaa ctttttcttt gaaaacctgt ttctggcttg ggtgtgatag 63240 

ctcacacctg tgaccctacc accactcatg aggaagaggt aggaggacta acagaattgg 63300 

aagccagcct ggactacaca gtgagtaaag gctatctata tactcaccac atggcaagac 63360 

cccgttttaa aacactgggc aaggtgaaac aaaagtcaat taatttcaca taaagtcaat 63420 

agcttcatta acggcctagt tatctttaaa actgtatgca ggttagtact tggtttcaat 634 80 

tttattactt tttctctgga acatttaaaa gtactttagg ggctggagag tcagttaaga 63540 

acagtggctg ctgccaaagg actggagttc actcccaagc acccaggtgg caatcacaac 63600 

tgtctgtcat ctaattctag gggatctgac accctcacag actcacaggc agtggaacac 63660 

caatgtacat aaaataataa ttaaaaaaat gaaataaaat accaggcaag gtggcacacg 63720 
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cctttaaccc cagcactcag gaggcagagg caggcagatt tctgaattcg aaggcagcct 63780 

ggtctacaga gtgagttcca ggacagccag ggctatacag agaaaccctg tctcaaaaaa 6384 0 

aaaaaaaaaa aaggacttta aattgggctg gagagatgga ttaaaagcat tggctgctct 63900 

tcccagaggt cctgggttca attcccagca ctcaaatggt ggctcacaac tgtctatatc 63960 

aacgcaatct aacaccctct tcaggcatgc aggttacatg tagacaaaac atccatatgc 64020 

ataaaataca taagtaaatg agtcttttaa tgtatactag aagctgggtg gtggtgcatg 64080 

cctttaatcc cagcacttgg gaggcagagg caggtggatc tctgagttgg aggccagcct 64140 

ggtctgagta aatagagcct tgtacttcta cttatcacta cagttacatt ttataacttt 64200 

gggccctagt gcttccattt tccactgttt gcttaaccac tggggcctga agcttttgtg 64260 

ctgacacttt tgttcgctaa tcatcaggca accaatggtc tctacactcc atcaccatca 64320 

acacaaacaa aacaaaacac aacactacgg atcctggcat ggtggaacat ctttagcccc 64380 

agtacgtggg cttgagttca aggccggcct ggtctacata gcaagttcta ggatagtagg 64440 

gatagtcttt aaaacaaaac actattttat ttatgaacaa aacatgtaaa gaaagaaaaa 64500 

aaactgcaaa tttatctatg aatgaagtct aagtaatact tcaatattgg aaatagcttt 64560 

ctaaaatatt tttatttaaa gaaaactcag caaattattc aaacaacctt ataaacgttc 64 620 

gttataaaag taaagaatta tttgcaattg ccttaagggt ccaaggtggc agcctcttaa 64 68 0 

aattcagaac aatccaagct tcacattcca gttcaacatt tctacagccc taacgtattc 6474 0 

aaatacctcc attctgacaa ctgtttcccc tcttcttttc ttctaagctg cttagatgtc 64800 

tgtcccaggc ttttcatgat tttagtcatt cacacaacta gcaaacatta tctagggact 64860 

aaaacttgcc agatactggg atatcaccct aaagggggac tgaaagtagc tgcaggctac 6492 0 

agtctctaca atctcctgaa tgaaatacaa agtagctaat atttaccaaa taaacatgta 64 98 0 

cacctgtgat gattgctagc tgtactagca gaagctaaac actaaatcta gaaactcagt 6504 0 

cctccaacta gccccttgct cggcttcagc ctcattttta caaacaaggg aaagagtttg 65100 

gaatgttgcc caaagccata cataagtgaa caaaaaggag ttggagtctc caaatgcatg 65160 

gatttgggct agttactttg ccaaccaact cagtaacaac tgagctgaac aggaacactg 6522 0 

tggtagcaaa agaaactgga actatcaatg gcctctagag caaaaatata tttaaaaaga 65280 

aaaaaacaaa caaggcctgg caaggagact gtgagaagag tgtgctgact gaaattgact 6534 0 

agttcagcca acaaaagact attccagggc tggtgagatg gctcagtggg taagagcacc 65400 

cgactgctct tccgaaggtc aggagttcaa atcccagcaa ccacatggtg gctcacaacc 65460 

atccgtaaca agatctgact ccctcttctg gagtgtatga agacagctac agtgtactta 65520 

catataatca ataaataaat ctttaaaaaa aaaaaagact attccagtgg ggatggaaaa 6558 0 

gttaagtgtg gagttaaaat atacttcaac tggtgatgga ctaggtgtcc agagtcgggc 6564 0 

aaaaggatgc tctgtggtag aggtgcctgc tgtgtaagcc cagctacctg agctcaatcc ' 65700 

acagaatcca cagcggagtg ggaagagaaa caacgtccca gagttgtcct ctggcatccg 65760 

acgcacattc gccatcccca agatgtcata catatgtgta catactacac actggcgcac 65820 

gcgcacacac actctttttt aaaattcaga cttagaggga cataaaggat ttgctctgat 65880 

atatgttcaa ttgaaaatga ctttgaagat agagggcaga tcgaaggaag ctcagcagga 6594 0 

aagaattaat aacatgcagg tgaagggcta taaactagtc tgcagagggc cttggctcga 66000 

caaaaaaatc tatggggttt gccggtaaaa taaggaaaaa gttgtcaaca tgaaacacag 66060 

aacactagca agagaggagt gttagcagaa agaagccaac aagctcaaac aattaggtcg 66120 

gctgaaaaat tttaaaatgt cttctgattt ggctactggg aagccactgg tgacttcggt 6618 0 

cagcgttttc tctctcgtga ccagagagat gtctagtagc aataatgagt taggaggatg 66240 

taaaagaagt aaaacagccg aaaacaagtc caaaaagttt ggggtgatgg agaaagggag 66300 

gaaacagagg ccgccgaaga tagacagcgg catgtttatt tgtcttgttt tcttagatgt 66360 

aaacaaacta aaaaaactcg tgagttcttc tgccagtacc gggttgcctc cagcatcctc 66420 

tgatggtctt agagaccccg ggatgctccc ccgcggccgt ataatttcct ccctgacgct 6648 0 

ctcccgatcg acagcggctc cctccccggg tcctctttgc accgctccaa ggccgcgctg 66540 

ctagggccat cgagcccgct cagggtcgtc tccttacctc gatggccccc tcgctcaggt 66600 

gtcccaccat ggctgcaccg ctaactcccg cgctcgcgct cttgcaccgc ctgagcttct 66660 

ctgccggggt cccgcgggct gctcaacgat tggctagagc aactgtgcgt gccgatccgc 66720 

ccccagcgtg agcgcggtgc gaggggcggg cctagacgcc gatagccacc gcattggcta 66780 

ccgcgcggca ggcagagcac gtgactcttc cgaggccggg ttcgaggcct agtggcggga 66840 

tggcgggacg tgagggcggg gcgctgggtc gcagtgcgcc tgtgtcagcg cggtgctact 66900 

gagttgttcc cccgccagct gtcggaactt tgcccgccca gtcctttggc ggacagacag 66960 

aatggcaacc cagggaacag tcggagctct cccctggtaa ctgctgctaa atatagtcaa 67020 

agcagtgacc tgggtacttc ttcacgcagt gcgtgcccgg cgccggtgcc aggcccagag 67080 

cttggcactg tgggataaac aaggtaaatc agactcagtc tccgccctct tgagttccac 67140 

ctgagagttg tggccgcaag gaacccagcc tcaaggatgg tagacgcgat atgggccaca 67200 

catgtggagc tccagagtgg gggtcaaaaa tcaatcaggc tttcgagagg cgatgcggtt 67260 

tgaactgagt taaagtgtgt gtagaaattt gtcaggtgga ttccagtgag gatagtgatg 67320 

ttcctaaaag cccaaatggc ctatgcaaaa gtattggaga gcctggcgtg ctggctggct 67380 

ctgatctgtt tgtaatccca gcctttggga tgtagaagca gcaaaagttc aaggtcaccc 67440 

ttaacaccgt tgagttcgag gtcaacctga actaaatgag accctgaaaa atcaaaattt 67500 

gggacccagg cgtggtggca ttcgaggtaa aagcaggcag atctctgagt tcgaagccag 67560 
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ccaggctaac ataagatccg gtctcaaaaa aaaaaagtaa taaaaataaa aagggagaga 67620 

ggctatatga actgaaagaa agacctggag atcaaaacag aaaactgagc cgtctaagaa 67680 

atgaaaatat ttaacttcat agttgctgga gtaagaagtc tggaaaactt tgggcaacta 67740 

aggtaaacag gtctagaaag actggaatag tagccatcta ctggtatttt gatctctgtt 67800 

tgtacaacca caacctacta tagtttctca aacagttcca aagaatatgt ctgggtgaat 67860 

tggtaccaca ccacagatta actctccttc agcatatcaa cagctataga aaaccccaga 67920 

agaaatgatt ttggttgcgt gtcacttggt aggatgaaat ctcgattttc tagaactatg 67980 

cattaataga aagctgaatc ttcatgttct gactttacag agctgcggca gcatggatct 68040 

accggtggat gaatggaagt cctacctact taagaagtgg gcttcactcc cgaagtctgt 68100 

gcaggacaca atttdiacag cagagacttt gagcgacatc ttccttcctt cttcttccct 68160 

tcttcagtaa gtgaatggaa acttcaggga aattttggtc tggaaaatgt tctgccttgt 68220 

catttggtct gaatatctct tttttatagg agagagtagc tttatattct ttatagtatg 68280 

gggcatttag cagttactgt tggttttcac gtttctccct agtctgtgat tactagaatg 68340 

ggtaggcact aactgctttc ctcttttggc atgtgttata cttaaggaat gtagtatctt 68400 

gctgtcgtcc cagtgctgtc actcatagga tctggtgcag gttgtgtagc tgcccctaga 68460 

agctcattca gtcctaatgg ggagaaagaa ccctggcact tggttagttg agacccanaa 68520 

cttctcaagt tctnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 68580 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnntttagtt 68 640 

tccaagtgcc actttactgc aatgtgtcac cacatccaga gttctgtgtt tgtttatttg 68700 

tctgtttttg agacagggtt tctctgtgta gccctggctg tcctggaact cactctgtag 68760 

accaggctac cctcaaattc actgagatct gcctgcctct gcctccagag tgctgctgta 68820 

cactaccacc acccatccag ctttctatat tggttttcct atggcgtttt aaaatagcca 68880 

ttatacgtgt gtttatcatc taaagtcctg gtcccaaaag gagatgagag gggctgctaa 6894 0 

ggtgaaaagg attacaaacg cttatcaatt ctgttcaaaa attaaacctc agagtgggat 69000 

tagctgcttc tttcattaga attgctatca gaattcactc aggccttgtt tgcgtgtgtt 69060 

attgaagaag tctcttcctc atcaggtcag tgactcctta gctcaagtac atgcaatatg 69120 

cagtattgat aactgttctc gcttaggaat aaaaatagaa ctgacttcca cagggaaatg 69180 

atgtgctgag ctgtagcaac gaatcttgca caaactctgt cagcagggac cagctagtct 69240 

ctcgcctgca ggaccttcag caacaggtct gcatggccca gaagcttctc cgaaccggta 69300 

aaccaggtga gattggctcc ctcgccctag gcctcagccc ttcccttgtt tattttggta 69360 

tcaccttgcc ttactgagca gtcctcaata aatgactgag gacttgaatt taattatccc 69420 

agcaccagcc acaagatggc tatgtaggcc agtgagacca gactgtgacc agctgttact 694 80 

ctggtgccct tgaaagtctt cctgatggtt taagctgtgt ctgctgcgcc agatagttct 69540 

agcagctcga gcaccagaaa ggctgtctga cttccatggg ctttgtgtgg ctccagaggt 69600 

ccaatgccat catctgattc ccagcttaag gacctaagct ccgagaaggt tgctctgccc 69660 

tcagcagcag cagcaagtcc tgagtgctgc ctgggctcgt ggtgtgactc aggagtagag 69720 

ctcggtagct agcctgagct gagagctgag agaaagaaag gactcctctc ttttcagaaa 697 80 

gggatttgca gaactcgatg ttagaccctg acatggtagg aatctgtttt gactattcta 69840 

gcctagattc tgaagttgac ctttagccta gagtcaagaa aactaatgat tacaggagga 69900 

atgtagagtt ggttgttaaa tgttggttgg aaaatggatg ttagaagccc agggtaaatg 69960 

tgaggaagcc tcatctaaca cctcttttac tgaaagagaa aacataagca accaacagct 70020 

tccctggaat gcccggctgt tgactccgtg agataaagag gcattttcac tttgacctaa 70080 

ccgatagaga ccttgcaacg tggtctctcg tgtccaggac tagatctgta tctgttgtga 70140 

ggcatttttc ctttgaatcc atagagcaag ccattcagca gttgttgcgg tgccgggagg 70200 

ctgctgagag cttcttgtca gcagagcaca ccgtactggg ggaaattgaa gatggcctgg 7 0260 

cccaggccca tgctacctta ggtatgctac cttaggtata gccggagttc tccttccctg 70320 

ccgtgtgttc agtgcggccc ttgccttgtc tgtttggttc tctcttgcca tctgaattga 70380 

cgctcttctc cctcccattc tgcattcctt gcccccagag ccttaggcta atggtgtttc 70440 

ttttccggaa tgagacattt ctcttctcac agggaactgg ctaaagtctg ctgcccatgt 70500 

acagaagagt ctccaggtgg ttgaaactcg ccatgggcca tccagtgttg aaattggcca 70560 

tgagctcttc aaactggccc aagtcctatt caatgggtag gcctttcttt ttcctagtgt 70620 

ttggccaggg cacacagtgc tctgtgtttt cctaggtgct tctgtgtatg gctttttgct 70680 

acagtgcttt aaagcatgtt gaaactcttt tatttcctct ttaggacaga tatttgccct 70740 

ctgcttcact gatagacttt aagctttgaa ttccttcctg aggatgtgga gaaagccatt 70800 

aggtctgcat ggagcttccc agggaggatt tggaggcagc ctcacccgcc tctagcattc 70860 

ctgtctgctt aatcacacct cccttggctg cctcagtccc tgctctctca actccagggc 70920 

tcggcccttt ccctggtttg cctcttattc cttttaaagc agtggttttc aactagaagg 70980 

gattgcaaat ggcatttggc agtgtttaga gacagttttg attgttatgg ctgccagcat 71040 

ctagtaaagg ctaaacctac agtgcacagg accgcctcca cagtggagag acccaagtta 71100 

gctatgtgaa ggctgagaat ccctgctttg gagattaaaa aaggaagctg agggaaccac 71160 

tcagttggaa gcacccttgg tggcatgcac aaggccctgg ttctgtccct agctctgcac 71220 

aaaaaataga atacaaggaa gagtaaccct aatgagctgg tccctcaccc agtgtgccac 71280 

tgaggtcact tgaagggaag tctagcccca atttagtatt ttttgtggct gccatacctc 71340 

cagccttgat caaatctcat ggtatacatt ggtaagaaaa agggtttgaa acatagacct 71400 
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gatactcgga 

ggcaatatca 

ctggataagc 

ccagagtcta 

tgtctcttat 

gtgccccctg 

aggagtccac 

ggttggcagt 

actgtggccc 

actcgtcatt 

gagctctggt 

ccttcccacc 

cactcacctg 

atgcccagaa 

cgggcttggt 

caaggccagc 

tcttgaaaaa 

aaagggttgt 

tctcactaaa 

gtgctgtgtt 

aatatatagc 

atatcctgct 

ataaatacgt 

accagtgtcc 

agcactgatg 

acctgaaagc 

gtctagcctc 

ggccaccttg 

aagtccaggg 

atgagtctgt 

gggaagccgt 

cactccaggg 

ctcattggta 

tgctgctcat 

ggtagtaata 

gtacattatt 

catacactct 

tgtcatggca 

agaagctggt 

gagtctcctg 

tttggggtag 

tagtccaaac 

gtgctcttcc 

aagctttttt 

ggaagtcctg 

tgatgccctc 

caaagcactc 

cttgagaata 

ctattatctc 

aaaacacctg 

ctgcttctcc 

tgtaactcca 

attatacaga 

nnnnnnnnnn 

nnnnnnnnnn 

ctctctatta 

atccactact 

ttcaagatct 

aaaacagaaa 

caagttgtca 

gggctgtagc 

tggggtttga 

aatgacatat 

aggtctctct 



catggaaaca 

gagagagggc 

ttttataaat 

tcacaaacat 

ctatactttc 

taagaccctg 

tttcgttgag 

gcctgaagct 

tgagagtgag 

cgtccctgtg 

gatgactgag 

tgacctgagc 

ccatgctggc 

aagactaaat 

ggcacacgcc 

ctggtctaca 

ccaaaaaaat 

cactgatctg 

cgatgaatct 

ccactccagt 

aagaaatcag 

tccttaaatt 

cttactgttg 

taattcttac 

caaggcaacc 

agaccactct 

aaacaacatc 

cccagcttga 

cactccagga 

ctgcagggca 

gagtctgtct 

ggagccatga 

gactgtgagg 

ggtttccaac 

gtccagacca 

gctgatgacc 

gctttttaga 

ttacagtgag 

gttaaaagac 

ctgtctgtta 

agctttttca 

tcatgccttg 

tcaaaggata 

tagatgcttt 

ggtttgattc 

ttctggccta 

atacataaaa 

cctgtgctta 

agcactcaga 

tctcaaaagt 

agagggcctc 

ggtccatgag 

gagagagaga 

nnnnnnnnnn 

ntctagagtg 

tgctgcctgg 

tctgcctctc 

tagatttcca 

gaaatcaggt 

aaactttgct 

caacttggtc 

attgatttga 

tacaaggtgt 

gctcaagctc 



gtatgtttgg 

atcagtcggg 

gctggttaag 

acacagtttt 

agatggtaag 

ggggatgcag 

actaaggaag 

ctgagtgcca 

gaggtccggg 

gggcccttgg 

atgttctgtt 

ctagtgtctg 

tgctgcttgc 

ccagtatcta 

tttaatccca 

aagtgagttc 

aaaaaataaa 

caggcagctc 

gtttccctgg 

gagaacttca 

tatgactgct 

tttcattaag 

gcagtttctt 

tcagtggctt 

ctgtagcatg 

ggctgtttca 

ctcctgcctc 

gactgcttct 

ggagccgtga 

ctccaggagg 

gcagggcact 

gtctgtctgc 

ctacatctcc 

tgagacagag 

caagggacag 

agtttgtttg 

acttgctcat 

gaaataaaca 

agtattcaaa 

atttaccctg 

ggaaaaaagt 

tgttttattc 

tatttaacgt 

gtaagatggc 

tcagcagcta 

tgtgagcact 

ctaaattcaa 

tattataatc 

agattgagac 

aaaagggctg 

atttcagttc 

atccaacacc 

gagagagaga 

nnnnnnnnnn 

tttcaggttt 

aactttctat 

agtactggta 

aagaagccgt 

acacttagag 

cttaattcta 

gagcccttgc 

acccagtaga 

ggccttgttg 

tgcccagggt 



tcagagagag 

ttagacacga 

gttttgaatt 

ctcaaatctg 

accctgaggg 

tggacccgac 

ggaactgaca 

tctggaaggc 

agctccggga 

tgtagagcaa 

ggcttggagc 

gcacagagag 

aagagctaac 

aaggctgctt 

gcactcggga 

taggacagcc 

aataagtaaa 

atgctagcct 

aacattccta 

attcttagtc 

tacctcaaga 

actggtgata 

cctgggatgg 

tcattgagtg 

acttcatagc 

cttgcagaca 

agccttccaa 

tactgccatg 

gtctgtctgc 

agccgtgagt 

ccagggggag 

aaggcattcc 

agatgccccg 

ggaaggactt 

atagctatgg 

gatgagttag 

cctagccatg 

gtcctgaagg 

catctgcgga 

acaaggaggt 

ttagtcctac 

ctaaaagccc 

tttatatatc 

tcagtggtta 

acaccagctg 

gcatgtgcgt 

aaaactcttt 

tagtatgagc 

tagatggtca 

aaaaagtgtc 

ctaataccca 

tctggtcgtc 

gnnnnnnnnn 

nnnnnnnnnn 

tttttgtttt 

gtagaccagg 

ttgaaggcat 

agcctagaaa 

ctgtagaatg 

aagagagaag 

atgaagctgt 

ttcgtatatt 

gaggaattgt 

agaaagggag 



cgaaggacct 

gcattccaca 

tgcccaattt 

ctttgcagta 

cagaggaact 

tttgtgttct 

agcttccctt 

agaaaggatc 

aatgaggtcc 

tcatcctcac 

tctcatcaga 

cacttgaaaa 

tgccctctga 

taaagggttg 

ggcaggcgga 

agggctacag 

aaataaataa 

aggcttttgg 

tggtttctag 

ttgtattata 

gacatacaat 

tataatttgt 

caactctgta 

ttcttggcac 

acaggcctcc 

gaatcttact 

gttctaggtt 

tctcttcagg 

agggcactcc 

ctgtctgcag 

ccatgagtct 

aagagcagcc 

agtgctgtgg 

tgcccctttc 

ggttttctga 

tgggaaagaa 

cccaaggagc 

tgcctggcag 

ctgggaactg 

gacttgaaag 

agacaactct 

tgtcacactt 

aggccttatt 

agagcatgta 

ttattccagt 

agtgcacaga 

cattgtctca 

cagccacggt 

agagctagag 

tcagcagcta 

caccgagtga 

tgcataagct 

nnnnnnnnnn 

nnnnnnnnnn 

tttttttttt 

ctggactcaa 

gtgtcaccac 

aggttaataa 

tcagcatgtg 

ctgtcaaaag 

gtgtttactc 

tgaatgttta 

caatttgggg 

cctcctcctg 



gatagacgag 

gtgagcagct 

tgtcaggatc 

tgcccgtgaa 

cagacccttt 

ctgcacagaa 

tctggcttca 

ctgttggtgc 

tgcttactgg 

cctcaagaag 

gaggacggga 

cagattgaga 

tggaaacccc 

tcactgcagc 

tttctgagtt 

agaaaccctg 

ataaataaat 

ctcgatttca 

tagtaatgaa 

attgaaaaat 

tccacttaca 

gaatggagaa 

ttggtttcct 

tcactgtcca 

ttgttagcac 

ctgtaagcca 

tataggaaaa 

ctcacacatg 

agggggagcc 

ggcactccag 

gtctgcaggg 

atgggcgtca 

ttgtgagcac 

cctaaggatg 

ctcatcctta 

gacccaagtc 

agccgttgac 

cagcttttca 

ggcagcattt 

gtttgttttg 

atagttattc 

tgtaaaatag 

ctgtgcatgg 

ctgctcttct 

tcctgggatc 

caaatgcagg 

tgtgacctag 

agcaacacac 

tctgggttac 

agagcacaca 

ctcaaccacc 

cctacactca 

nnnnnnnnnn 

nnnnnnnnnn 

gagacaaggt 

acttatagtg 

accccactac 

gtactgattt 

acatttgtga 

acttgaactg 

cccagcactg 

cctcatgggg 

gtgagctttg 

gctgtctaca 



71460 

71520 

71580 

71640 

71700 

71760 

71820 

71880 

71940 

72000 

72060 

72120 

72180 

72240 

72300 

72360 

72420 

72480 

72540 

72600 

72660 

72720 

72780 

72840 

72900 

72960 

73020 

73080 

73140 

73200 

73260 

73320 

73380 

73440 

73500 

73560 

73620 

73680 

73740 

73800 

73860 

73920 

73980 

74040 

74100 

74160 

74220 

74280 

74340 

74400 

74460 

74520 

74580 

74640 

74700 

74760 

74820 

74880 

74940 

75000 

75060 

75120 

75180 

75240 
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gaggacatag 
ccaagtctgc 
aactgtaagc 
ctcttcataa 
cctttaatcc 
ggtctacaga 
aaaacaaaaa 
ggagtacaaa 
tcaaaacgaa 
tcttgctttg 
gcattctcca 
gcctgcctgc 
agacatgcct 
acaaatggtt 
agccagtgct 
tggttggttg 
tggaacttgc 
ggataaagtc 
tcattccagc 
tgatcaccct 
tctgagtgtc 
gagaatgagt 
gggggggggc 
acagcctgca 
ccacctccat 
gccttaggat 
tggagggtcg 
gagcatcctg 
aaagcaacct 
cacagaccat 
ataagtgggt 
taatatgctg 
gccccagggc 
aaagcaacgt 
aggggaggcc 
cagttcttga 
tgtattgtag 
gctcatgggg 
ccagccccac 
catatgctac 
tcaccaccat 
ggggaagccc 
catttccctc 
ctcctgaagg 
ctccagaggg 
atgagagctg 
catcttcctt 
tggattccac 
cctctttctg 
cctgagcaca 
tgaagcaact 
ttatgagttg 
ctactgactc 
aacagaggca 
gtagagagcc 
gatgccaagc 
tccttttgtc 
tcttcagggc 
aggttatcag 
ctaccctgtt 
taaatgacta 
agtggtaggg 
gcatggaaac 
cacccagaga 



tctcctggct 
ctgcacaatg 
cagccccaat 
caataaaagc 
cagcacttgg 
gtgagttcca 
caaacaagca 
gtcgtctcag 
agaaacgaat 
tttttggcgg 
ctgaggaaca 
ctgcttgtta 
ggtatctgca 
gtaagctgcc 
cttgttgttg 
gttggttggt 
tctgtaggct 
atgtgccacc 
ccttctttgt 
ccctcagcct 
agatcttgaa 
tctgataact 
aagccccaga 
catctcagtg 
ccctacagac 
ctgacatcat 
tctttctcga 
agccactgtc 
gagctgtcag 
aggctccatg 
gttctctctc 
gtgtgtggta 
tttcctctaa 
aaaatggaaa 
tgtggaaggg 
gcaggggtgc 
agtgtctggg 
gcagagccag 
gcagccattt 
aagcactgtg 
caccacacac 
tcacccttga 
gtggaccggg 
tctactcgag 
gcaacaaagc 
aatcagcacc 
tgacttgctg 
cgtttgcatt 
ttcctctgtg 
cacagcctat 
tcacgccagg 
ggggaccagc 
ctctcaaaca 
aggagctcct 
cggtgaatga 
ttcataactg 
ctctgagagc 
gtggaaagaa 
agctacagag 
ctgaggagga 
aaacacgata 
attgtgtgtg 
tctgagtggg 
aa gggcagag 



gccttcagat 
ccatgcttcc 
taaatgtttg 
ctaactaaaa 
gaggcagagg 
gaacagccag 
aacaaatgcc 
ctagtatgtt 
gaatagataa 
tgctgaggat 
ccctggcgag 
tgtgtatgag 
gaggccagaa 
atgtaggtat 
gttttattgg 
tttcttgaga 
caaactcaga 
aactccagac 
gttttgagat 
cctgactgct 
aacccattct 
atttctcact 
aacattgata 
gccatctgtg 
cttgcagttg 
ggggactctc 
gtcaaacttg 
cccagaaagg 
cagcaagaac 
caccacgggt 
tgacttggat 
cgtgcctgct 
gttaggatcc 
ttaatcaaac 
tctcctgagg 
gactccagga 
agggagctgt 
gattaaacct 
cactggtcaa 
gccttcagat 
acacacacac 
agtaagccac 
caatcttccc 
ggttgcccgt 
tggatttgac 
ctcactttga 
gggcatctgc 
ccccgtccca 
gtctcactgg 
tgtttagaca 
cataatgggg 
tgagactcta 
caattctgga 
tgtaggtgct 
cagactctgt 
cccgtgaggc 
ccgaggaggc 
ataatgaccg 
agaagagtgg 
gccagctctg 
ggaggccact 
tggcccccac 
ggaaattcag 
agggccgtgg 



caagatgtag 
taccatgatg 
tctttataag 
cacattcctg 
caggaggatt 
ggctacacag 
agcatttggg 
tgaggccagc 
acatttgagt 
taaacccagg 
tgcctagtct 
tggtaacctg 
gaggatgttg 
tcagaattga 
gggcaggagg 
cagggtttct 
gatctgcctg 
aagcagccag 
ggtcacaaag 
gggggttaca 
cgtgaccttg 
cctcttcaag 
acttgcccaa 
ccatatccac 
agatcagagt 
acggtcctgt 
gttacccact 
ccacaggtcc 
acaggagccg 
tctggctatc 
caccacgctc 
ctcctgtccc 
actcttgcta 
cattcctgcc 
ccttaccact 
gttgggggct 
gttatcctta 
ggtctgattc 
ctaattcaga 
ctccctctgg 
acacacacac 
gccaaggaga 
tttcatctgg 
gaggatccgg 
tcgaagttct 
aagcatgcca 
cggcttgccc 
ggccctcctg 
ctctttcctg 
tcatggaagc 
cgtgcctgcc 
tagactttga 
agcactcttg 
gtggatgtca 
tcgatgttca 
ccgcatcagg 
cgatattgca 
ccttgctagg 
gatgtggtcc 
ctcacggcag 
taaggaacca 
cgcccctttc 
gtccacctgc 
ccacgctgag 



aactcttggc 
ataatgaact 
agttgccttg 
ctgggcagtg 
tctgagttcg 
agaaaccctg 
aggtagagtt 
atggaccaca 
gtccagtttt 
accttgttca 
gtctgtctgc 
catgtctgtc 
gatcgcctgg 
acctggtgct 
tagttatttg 
ctatgtagcc 
cccctgcctc 
tactcttaac 
tacaactcag 
ggtgtgtcac 
atccttaaaa 
aaaaggaaag 
agttacacag 
cctgcccttc 
ccaagccgta 
cctcgtccaa 
gccctggaag 
acgctgtgcg 
ctgggtccca 
ctcctgtagt 
cttctgttta 
ctggggactt 
ctactccata 
acaagaataa 
gtctagaagg 
gctctgagag 
ccgtgaagat 
aaaaagccag 
aacacttggt 
cctggtacct 
acactcggcc 
aggatgctga 
gcctgcaccc 
ggcctggtcc 
tgagaggaaa 
gaccccagct 
agaccctggc 
ctgtctccca 
cccaccagtg 
atacagacaa 
cttcagaagc 
gaggtggggt 
aggttcttct 
gggttggtga 
gctcctgcca 
atagggacgt 
gatgttttag 
aagagctcta 
tgaagttctc 
ctgtacccct 
aggtcgagtg 
ctgttcctct 
agccttcttc 
tggagacagc 



tcctccagca 
gaacctctga 
gtcatggtgt 
gtggtgcacg 
aggccagcct 
tctcaaaaaa 
aagaagattg 
tgagacgttc 
ttcctttctt 
tactaggcaa 
ctgcctgcct 
tgtataccac 
aactgggatt 
ctgaaagagc 
gttggttggt 
ttggctgtcc 
ccgagtgctg 
cactgagcca 
actgagctct 
tgtcctcaat 
caaaccctgg 
ccagagaaag 
caaaattcag 
tctgacctcc 
tcgtaagatg 
atgaaaatcc 
gaaacagatg 
tccactggcc 
gcatgtgtgg 
aaactcagaa 
aagtggcctt 
ggagtaggaa 
agatggtcac 
aacagatctc 
aagttgacag 
caggacagca 
gaggacacgg 
agatctgtgc 
ctgatatgct 
gcattcaggt 
agagacaagt 
gggcactggg 
agttgttaat 
cataggactt 
cagatcaaag 
tcctgctcag 

tagggaacag 

gagcccactt 
ccaggcctcg 
cccaggccaa 
agaggcagct 
gggggtgggg 

caggggcagt 

tcaggtcgta 
gagagaaaag 
tagacatcaa 
ctggacaaga 
agacagggca 
ccatcgtaac 
agaacctggt 
ccacttacaa 
gacggcggca 
agttgacact 
aggacgttgc 



75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
78780 
78840 
78900 
78960 
79020 
79080 
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ccgttgggct ggcactggat ctcaggcggt acagatcgta gccgaagttg gagacagctg 79140 

ctgccagctt gttcacaggg accttgaaga aggggtcctc ctcctccacg ggctcgcccg 7 9200 

tgctgtccgg gactggggag ccctgggtta gaatacaagg accagtaggg aggcacagtg 79260 

agtacatcac ctcctggttg ggttggtcct ctagtccctg gggccatgag tctgaggtca 79320 

gaatgagtgt gtgctctctg actccacaac ctgtgtgctg ggaggtgggg agtgggaagg 79380 

gcaacacaaa agggcttgcc agacctgaac tgtggtctga gaacctgaag cctggcccac 79440 

tttaaaataa aacttgtagg gctggggaga tagcacagta gataaagtac cagcatgcaa 79500 

gttcaaggac ctgggttcag tccccagagc tgggcacggg ggtgcatgct tataatccca 79560 

acactgggga ggcagagatg ggcaggtcct ggggctcatt ggccaatcag cctgaactaa 79620 

tcagcgtatt ccatctcagt gaggggtcct gtttcagagg gcctgaggaa tgactctggg 7968 0 

ttgactacta gcctactctg tgtctgtttc tgtctgtctg tctgtctgtc tgtctgtctc 7974 0 

cacccctctc tgtccccttc cctctgcagg gaacttcctc accaccacca acccccaaag 79800 

aaacccaccc tcagaccagt cttccctatt cagcttgctg gctggtccta gtctgcctag 79860 

gttctgctgt gacgccctcc ctgtctttcc tgacaagcca tcccctctga ctagacccga 7992 0 

gaggaatttg tcgttttctg acctgttttc agtgtcagcc tcttccttat gagactttct 79980 

gcttttttgt tttgttccca gggctttgga tcaaagctgg gctcttacat acgttaggca 8004 0 

aatgcttggc cacccagctg tacctcccgt ccctgttgct tttcggtttg gaggactttt 80100 

tttttagttt tctgtttggt ttttggtttt gattttttgt tgtttgtttg tttgttttga 80160 

gacaggattt cgctatgtga ctctagctgt cctgggactc actatgtaga ccaggctggc 8022 0 

cttagattca gagatccacc tgcctctgcc tcctaagtgc tgggattttt agattttaat 8028 0 

ctgtacctac caacctcaaa ggaagtgtcc atggatagag ttcagtacta catcatgtgt 8034 0 

gtaacatgtg tgagggcctg ggcttcaccc ccaacagaga agggggagtt agtaggtgag 8 04 00 

gaataagtga ctggctagtg gcaagacagt attgtctaag gtcactaagc cttaagccac 80460 

acttaaagcc cacaatccag gtctaatatg cccatctgcc ttgtccttgt gtgacatgac 80520 

cccaccccta cttcctccgt atagtggcag ctcctctgga tcctgaaagg agagggaaga 8058 0 

tattcttgtc tcgatgttaa agtaaccaag gcctagaaga gtgaaggcca aagcccaccc 8064 0 

tggatccagg gctgcctccc tgcactgtct cttctgctgt- cccacctacc caccctactg 80700 

acctcagagc tgctggggac gttctggctg ctgccgtgcc cgagcagggc tccagtccag 80760 

aggagtagca ccagggcctg catcccggaa ctacaagaga aacaagagag caaacgactc 8082 0 

ccctcaccca caccctcccc tgccactgca cattgcacac tgcacaggga caagagtcag 80880 

gcagagtgag cccttcccct ccctccagct ctcagcccca agtggaccct tgacttgagg 80940 

tcttccgtcc ctgacctgcc cctgcacttc tccttgagct gtgcccccat gttggttcct 81000 

atcaggagac ccaccttccc atctaagctc cagcacaggg aagacccagc agcaggctct 81060 

tcaggcccca agacaatgct ctagcacaaa cacacaccaa ggcttttccg tggaggcaca 81120 

cggccagctc ctttggtagg atttggaacc ctgtctcagg atgggagcag agcccaggtc 81180 

atagacttac agaacatctg gtctggtcct gctatccacc aatagttctc tgaccaaagc 8124 0 

ctatgttaaa gacacacaca cactttcttc taggtaggtt cttgtgtatg tagcccaggc 81300 

tagccttgaa gttgcagcta ccctacttca tttgcctcct gagtactaca atgccaggtg 81360 

tgagccatca tgcctgactt gactcccttc ttctatttca agagcaattc ttagttaaga 81420 

gggtatgaac cagggccaca ctgccnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 81480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 81540 

nnnnntctta tttttagctt gtttgttttt cttacttgag acctggggag gggaggtgta 81600 

tgtgtcttcc atttgcttct tctacctaat aaagtttctt ggtgatgttt gggggggggg 81660 

gagggggtag gactcaagaa gggattctcc cataagctgt tccgtttggg tagtactatg 81720 

taaggaagtt acaggtgggc agagctcggc tctgcctgac tggcgtgctc tgaggtaaag 81780 

gtgagatggt gcaagatttg ggccctcagg agttggctct gttggccctg taccttctgg 8184 0 

tctgtgggta aggatgacca gtaggtgaga gatgagggaa ccagaacaga aggtgaaagt 81900 

tagtggggcg gagccccaga ctagtcaggt gggggtaaac tagatgactt tctggaaccc 81960 

caaggggctc ggagactagt ggtgttggag aagacctcta atgtgttgta aggcctctga 82020 

actcagtagc cgaacttgat gccagaaagc cccaaactgc taaacccaag caggagcggg 82080 

acgccatccg ttccatggct tcacccgagg tggccccatg gctgcgccaa tcaatgagca 82140 

gccgagagat aggggcgtgg acaagccagg aaaagttaca gcacgctgga aagataatac 82200 

aggccaggaa gccccaggca cagcagggtg gaaaagctag atcccgattc tgccggaggg 82260 

gggccccttc gaggtcccgg gcaccgggtg ccaggatcag agaaactgac tgaaacctag 82320 

ctgacctgcc cagaccatgg catcctgggg actccttgtg gctggcgctt ccttcacggc 82380 

gtttcgggga ctgcactggg ggctgcagct gctgcccacc ccgaaatctg ttcgggaccg 82440 

ctggatgtgg cggaacattt tcgtttcgct gatacacagc ctactctctg gagtaggggc 82500 

gctggtcggg tgcggaactt ggggactgac aaagcactga ggggcggggg tggaaaagag 82560 

ggcctggaag actgaagttg gaaccttttg gaatggaact ggtttgggtt gtggatgggt 82620 

gggagtaccc agtgggagaa tggatctagg tctgggagaa attgacctta gctctttgtc 82680 

ttctccaggc tgtggcagtt tcctcaaatg gtcaccgacc caattaatga tcacccaccg 82740 

tgggcacggg tcctagtagc agtgtcagtg ggtgagtgta cagaaaaggc tgaatcggga 82800 

aaggccttgt tggaccggga attctaggtt cctcccccat ctttggaatg gagcagatgt 82860 

tgctggaggt ttgctgtgag gaattaagga cctgagaaaa gtgggacttg agatatctag 82920 
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gctgtgcatc 
cgctgcagat 
ctgtcaccat 
aactcaaggg 
ctgaaaacct 
ctcctgggca 
aaaaaaaaaa 
gtaagagaac 
caggattgga 
aataaatact 
catgcgtggg 
ctgaaccttg 
cactatgtgg 
ctacggaagc 
tgggccagcc 
tggttgtccc 
ctggtcaccg 
ttgcagtctc 
gttgccagga 
ccccagggag 
caaggactag 
gagagagaga 
aactttaaaa 
atctctttta 
aacagtttct 
caatctccaa 
cttgggtcac 
tacctgtact 
aaacgctgca 
gtggcacatg 
ggccagcctg 
agagaaagaa 
gttcaatgcc 
tggctgactg 
aggagcttcg 
atgatttcat 
ttatcctggt 
ttaaggtttc 
tctctggtgt 
cggggggtgg 
aaggtcctga 
aatctgatgc 
taaataaata 
atatctctgg 
gcaccaggcc 
gatcagaaaa 
caggcctgat 
gctggcctgg 
tcacaggtgt 
tggtactgag 
catgggctct 
caaacccagg 
ctggtgataa 
gtatgtgtgc 
cccctttaat 
ctggtctaca 
taaccaaaaa 
ggtctgcaga 
acattaaagg 
tgttttgttt 
ggagactctg 
ccacaaagaa 
gacacacact 
tgtggtgcat 



agctctgagc 
ggagttgata 
ttggcggtaa 
cccaacctct 
atcctctcag 
ctgcccaagg 
aaagccagga 
ctctggtcca 
ggcagtccta 
aatttttaaa 
tgtccctagg 
aaccttgcct 
gcttctctat 
tactgctgct 
tggccaccct 
ggcagcacta 
tgggcagcat 
agccctaccc 
acacttccac 
gtggggctgg 
actgtatcat 
gaacattgta 
gtcccacagt 
aaattcaaag 
tccttcaaga 
ccgtccaatg 
ttctccagcc 
ccactgctgc 
tgaccccttc 
ctgataatgc 
ggctacaaga 
aaaactggac 
aagttggcag 
aacaccttac 
ctgaggagag 
cagtctcttg 
ctgcagtgtg 
tctggctctt 
gtgtgtgtgt 
agggggctgg 
gttcaaatcc 
cctcttctgg 
ttaaaaaaaa 
atccccctgc 
aggctagaac 
cagacccagg 
aacctgcggt 
aattcacaga 
gtgccaccac 
gatagaatag 
ccttccccaa 
tcttgtgtga 
cttatagcct 
ccacacaaac 
cccagcactt 
aagtgagttc 
aaaaccactt 
ccaggctggc 
atgtcccacc 
tgttttttct 
aagtcagggc 
accctctctc 
tgtaatctaa 
tcaagagcag 



gaggagcctc 
tgctgtggaa 
gactctgaag 
ctcttcagtg 
acttggattg 
agtttggttc 
atgaaagcta 
gggctgctca 
aaaaatgctt 
aagctgatac 
ataaggagct 
gcaggtagtg 
ggtatccctg 
ctcccataag 
ggtcctcttc 
ccagctgtct 
aagcatctcc 
gtttatcctc 
tctcagtctg 
gaagaggaga 
cagagagaga 
tgagatctcc 
ctttacatat 
tctttttaca 
gggaaaatat 
tctgggatac 
aggccctttg 
tgctgctctt 
agtcctgggc 
tagcacttgg 
gactctgttt 
atgacagccg 
tgcctagttt 
ccagcaactt 
aacattgaac 
tgaatgtgat 
gtgtggagtt 
ttctttcctc 
gtgtgtgtgt 
agagatggct 
caacaaccac 
agtgtctgaa 
aagaagttgg 
ctctacctct 
attctctgat 
gggccacaga 
tttttgtttg 
gatccacctg 
acttggccta 
actctcacca 
aggaaaaata 
tggctcagca 
tgttatgctc 
acacaatcaa 
gggaggcaga 
caggacagcc 
taaatattat 
cttgaactca 
actgcctggc 
aaaaaatttt 
cagccaggtc 
aacaaacaaa 
gcccttgaga 
cctgggtggg 



atagtcttct 
ccagacattg 
ggagaggcca 
ttctgtcctc 
agttggaggg 
tccccacaaa 
gggtatgcta 
tgccccctag 
gggtaatata 
ctggaaggat 
gggacttgtt 
agctgcctca 
cttctggagc 
gccccatcct 
cgccttctgc 
cttgctctgg 
acagggatcc 
atgcacaagg 
aaaggtgtgg 
tggtagccca 
gagagagaga 
attacagtca 
tcttaaaatt 
attaaaagtc 
cagggcacag 
aactcacgat 
tagcacacgc 
ggtggtcatc 
cttcaagaga 
gaatgacaag 
caacaaacaa 
gaacattatc 
gataagggtc 
gctgatagac 
cctattgtat 
tgctttgagt 
gtggaagctt 
ccggtttttg 
gtgtgtgtgt 
cagcggttaa 
atggtggctc 
aacagctaca 
catggatgat 
tagacactgt 
ctacaaatac 
aatgtcttag 
ttctgggggg 
cctctgcctc 
actgcctgag 
gctcttctga 
aatgtaagaa 
tcaaagctac 
tcctttgacc 
gaaataaatg 
ggcaggtgga 
agagctacac 
ttttattttg 
cagagatcca 
taaagattta 
tttaaaaaga 
tactgagtga 
caaaagagaa 
ggctgagaag 
ctacagaaaa 



ccggtgcctt 
gcccaggcct 
ggtagtaagg 
tgacttttcc 
aggttttgac 
cctccagctg 
tgcaaatagt 
ataagggtca 
aagtgaataa 
gaggcagaga 
gggcacaggt 
gcaccgctgt 
tgaactccat 
tggccttcag 
ctctgggatg 
ttctgctttg 
gaattctgac 
aaaccaagac 
aagttttctc 
ctgcatagtc 
gagagagaga 
ggaaatcagg 
tcaatctctt 
tcaactgtgg 
tcacaatcaa 
cttctgggct 
gtcatcctct 
tcatggtact 
gaagactaga 
cagaaggatc 
acaaacaaac 
tgacattcat 
tagccactct 
tctgccttcc 
atgaataaat 
catttttctt 
tgagttggga 
ttttgtttgc 
gtgtgtgtgt 
gagcgccaac 
acaaccatcc 
gtgtgcttac 
gtagtgaaga 
gagtatggaa 
ctagagtatt 
taggtaaaaa 
cgggagaggc 
ctgagtgtca 
tttgagcatc 
cttccacatg 
ttaaaaaaaa 
ctcccgccac 
tccacgggca 
cagccaggcg 
attctgagtt 
agagaaaccc 
ttttgtttat 
actgcttctg 
ttttttcttt 
accatccctc 
gctctagggc 
cagacccaac 
ttcaaggcta 
agaaagaggg 



caggttattt 
gggaccttct 
gagcatgtcc 
acaaagcccc 
tggctagcca 
atcataaaaa 
gtggcttggg 
gcagaaaggt 
ataaaaaata 
gtagaaaaaa 
catgcaaagc 
tgtgtctggc 
ctgtttgcat 
agtaagcagt 
gatgagtctg 
tgtggctggg 
caaggatatc 
acgtgagcct 
ttctgtcagc 
tactatgtag 
gagagagaga 
agatctaaat 
taaaatatcc 
gctccactaa 
aagcaaaagt 
cctccaaggg 
aggctccaga 
ggcatctcca 
gcctggcaaa 
agaagttcaa 
aaaccaaaga 
aaggtcctga 
ggtaatacca 
agcaaaaggg 
tgctgtgcaa 
ggctccagtg 
gggtttcctg 
ctggtggggt 
tagaagttgg 
tgctcttcca 
gtaacaaaaa 
atataataaa 
ctggcattag 
gtgtaccacc 
attcctctat 
cacttgcttt 
tggctggctg 
ggtaccagga 
agtactcaca 
tgccctgcag 
aaaaaaaaag 
agctgaccac 
tgctgtacac 
aggtggcaca 
cgaggccaac 
tgtttcgaaa 
cctggaactt 
cttcccaagc 
ctttttgttt 
ctagcactca 
agccagggct 
cagacctgag 
gccacaagtg 
agagagagaa 



82980 
83040 
83100 
83160 
83220 
83280 
83340 
83400 
83460 
83520 
83580 
83640 
83700 
83760 
83820 
83880 
83940 
84000 
84060 
84120 
84180 
84240 
84300 
84360 
84420 
84480 
84540 
84600 
84660 
84720 
84780 
84840 
84900 
84960 
85020 
85080 
85140 
85200 
85260 
85320 
85380 
85440 
85500 
85560 
85620 
85680 
85740 
85800 
85860 
85920 
85980 
86040 
86100 
86160 
86220 
86280 
86340 
86400 
86460 
86520 
86580 
86640 
86700 
86760 
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tggttaatga agatgactct ggaaaagtga aactcaagag aaagcccctc agatttgctt 86820 

aagacgagtt gagggtggag aaccgccaaa gcggacgagc cagacagaga ctgccaacaa 86880 

agttcaatcg gttcaggtac attacttcca aaacgccatt gccacatcag gatgcttcaa 86940 

tcagccaaac caacgcagcg actattgact tctgcatttc agagacttcc gtctctgtcc 87000 

agggcaatgt cactttagct ttcctttgca gaaaggaaaa gtccctgcct ctgatgtggt 87060 

agatcctcac acaccttctg ccagatccag acactggtat gactcagcct cggggagctc 87120 

tatctacaga gataagggta caaggcgtgt gtgtttaaag tatgtgttta aaagtacaaa 87180 

gtgagagtcc ctggaaaggg ctccctgccc tcaccatcac cgaaagcaca aaccttaggg 87240 

taatatctga cattcctgga aatgtatgta tgtattcatt atgtagccct gactgtcctg 87300 

gaatggggta taaaccagga tcgcttcaca tctcagagac ccatttgcct ctgcctccca 87360 

agaactaaga ttagaggcat gcactaccat acttggctca tgatttactt aactttattt 87420 

tatgttcacg aatgttagcc tgcatgtatg tgtgtgcacc atgtgcatgc ctggtgcccc 87480 

agaggccaga agaaggtgtt ggttggattt cctggagatg aagtcccaaa caactgtaag 87540 

cagtccaatg tgtgtgctgg agatgaaact tggttcatcc acaagagcag tatgtgctct 87600 

taactgtgga ggcatatctc cagcctcaga tttcccagtt aatgtttgct ttcgcaccca 87660 

ggcccatctg cgcatgcgct ggagacctcc tttaccgcct tgagcctcat tggccaattg 87720 

tggctgggag acttgcagat cccaagtggt acaagagaag aataaactgg tgtgctatga 87780 

actcacctct tctctgtagc cattggctga gcatactttg cctcaaccta ccgcccttcc 87840 

ttcccctaat cctaaatctt tgccctctcc aaatgtgctc ctcccccgca gtaatccagt 87900 

ggtcgctggg gctctagaga gatggggggg gggggagcaa cgggtacagc ttaaggcagc 87960 

tgcagcagaa cttttttgct gtatattgag tcttaaaaat tcatataaac tttgtgttct 88020 

gtttctaaat ataaccccat ctgtttcaac acaaaatgca acaacaaaat gtttcaaatt 88080 

gctatttgga ataattaaaa aatttcaata cttgatttaa aaatgcttta actttttaaa 88140 

taaattttaa atgttattat ttttaaaaag ttacaagttt aaaaaaaaga aagatagaaa 88200 

tcacataatg aaattaacca tacgcaagtg aggctcggtg cactggtaca cagttacagt 88260 

agccatttgg agtggaggcc atggcgcttc acattgaatt ttatactttc tttatagaat 88320 

attttttatg cacctatcta ctactgataa caaaacaccc- atgagagagt tagaattaga 88380 

catcaattag ctttgatcct ctgtcataac tcgtgtccac tccctgcctt agtcctacct 88440 

catccctgtc ctcttttcta catcttatac tgaatccaca cactcagttg tttacacaaa 88500 

cacatacatc actgtccann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag 88620 

tgcctggctc tgttttgctg ttgttgtttt tgtttgtgtg tgtgtttaga tttggtttgg 88680 

tttggtttga ttttgttttt ttttagagaa tcttactatg tagctcaggc tgtccttgaa 8874 0 

ctcacagaga tctcttgtct ctgccttcca agtgctgaga ttaaaggtat acaccacctt 88800 

acctggcccc tttcatctat ctatctatct atctatctat ctatctatct atctatcatc 88860 

tatctatcta tctaaaattt atctgtgtgt gtctgtgtgt acattcccca gagcctgtgt 88920 

ggtagtcaat aagtaaccct cagaagttgg ctttctctaa tccttggatc aaacttgaat 88980 

tgttaggctt ggtagcaagc atgtttaccc actgagccat ttatgacccc atggcccagc 8904 0 

atcttccatg ggttctgggg acacaaatgt gtactttgat gtttacagga caagcgctta 89100 

accaaccaag tcattttccc agccccatcc tgactcccat taagtgttct ttcccccaac 89160 

ccaggaccaa atctagagga gtgtccatgc ccaacaaaca ctctgccaag cctctcccct 89220 

tactgctctt ctcccttccc ttccttcatt tcttcgttcc ttcttttctt tctttttgaa 89280 

acaggtcttt tctctgcatc ccaagctagc cttgaacttg tgatgtagct caggctggct 8934 0 

ttgaactcac agctgtcctc ctacttcagc ttcccaaaca ctgggattat agacctatgc 89400 

taccacacct ggctcatttt tcaaataaat aaaaagaaaa tcaaaaagtt cctagaacag 894 60 

tcacaggatt cacaaaaact ttggaaggag actaaaaatg gatttttaaa aaatgcttga 89520 

agcacaaaga gttgttgaaa gaagagagaa gaggaaaagt tagcttagta ggtagaagtc 89580 

aatcaagcct cacaccctga gttcaattcc tgaccctatg gtagaaggag aagagcaatg 89640 

ccggaaacat tatcctctga cctccagacc cgctgtggca cgtgcatgca cacacacaag 897 00 

caggagccct tggagggaag tcctagaaat gaatcttact gaagcaggtc tgccaggccc 897 60 

tgtgctcagc cattttattt ttcctttgtg tacccgacac gcttccattc tcaaagttgt 89820 

gagtctgaga ggaagtactc actgtgtccc cagtgagctt ctgtcttacc ctgggtcact 898 8 0 

tagatggggt cacttagtgg tagccttggt gtggagaaag agaacacagg tccgagtagc 89940 

cagtagacct gagtctttat atctgcaaag ggtgttgggg cataatcaaa tctccccccc 90000 

tccccggggt cctgatacca ggttgtatta agtgtatgtg catggtcctt ccaagtcttg 90060 

acacatcatc caactccaag tggctttctc atttttcctt gccagtagcc tcttggtgag 90120 

gaaatggctg aggaaaacag agttgcagaa agacagggcc atggcctggc tgcaggcttt 90180 

ctctgagtct gaagagggtc agcgactctg agaaatgaag ctatttctga gtgagagggg 90240 

ccaaagaagg aacacggcag agggagagcc cccgaggaga tggagacaga agccggagag 90300 

ggaccctgtg cgaggctgga ggggaggaag aggggggagg agtgagaccc actgtcatct 90360 

gttgggcaga gaggggctac attcatctgc agtatggtgt agaggggaca gagagtgatg 9042 0 

gtaacaggaa aaatttgggg ttgagggggg cagcctgtag ggctgggccc cagcagtgta 90480 

cagctaggta gagtacacag taactcccag aattctctgg ctccactaaa tccctgttcc 90540 

gctccgtgca gagtaaaacc cacacagggt ggatttcagt ctcctttgca cccccctcca 90600 
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ccccccctcc acccccagct ctggtcacag ccagtcagag ttgggggtgg ggcagatctt 90660 

gtaaaagagg ctggtgagga catcggaaag tctgtaccct ccactagcaa agtgccagac 90720 

gctccgtgac actttaaatg cctcagataa aacagtgaga gactctcctg gtggcaggca 90780 

agatgatggg tcagggacct cagcgcctct gaggctcaga caccaggata aagaataaaa 9084O 

acaccacgga gacccctgtg accccctcgt gcagagggag aatgccaatg tggcccagct 90900 

agctgctgga gcgcaagcct caaggcctgt gctagttatg agtctactgc tgctgccttg 90960 

tcccaataaa cccctttccg cccaggatta gtggacacgc cttgctcaag ccgagtccct 91020 

gatctgccac cttatcacac acatacaaaa atcccttgag gatggttacc atcctgggac 91080 

aaagcctcat tctctgctta acccaagtga cacctatatg gcagatccct gtgtccttct 91140 

cctgatgata acaacccttg caatccaata gaggggaact cgggtttcfcg tcagcttcct 91200 

ttatgctgat agaaatgtac tctgcatgtg gggagcctgc cttgctcacc ctgagacccc 91260 

atggggctgg ctggggcttt gcacatcatt gggactcaga gatgttgact acatgaacgt 91320 

cccacacttg gttgcacaag gcagaatgac aggatgttat gcctggtgtg tgagtgtgtg 91380 

tgtgtctctg tgtgtgtaaa acgcctctct ctggagccct cctgtctgtc tgcctcttgt 91440 

tcaatggctg cacaattgtc ctttctcttt ccaaggacct ctgtatgggt gtgtccttca 91500 

ttcagtgcct ttcctctgtg ggtttgtcct gctagccccc tgtcactgag aaagtcttct 91560 

gtctgtcctt gggttgtctg gctagaacac agacatcatt gtcttttttt tttttttttt 91620 

ttttttttaa agatttattt atatgtaagt acactgtagc tgtcttcaga cactccagaa 9168 0 

gagggagtca gatctcgtta ggatggttgt gagccaccat gtggttgctg ggatttgaac 91740 

tccagacctt cggaagagca gtcgggtgct cttactcact gagccatctc accagccccg 91800 

acatcattgt cttgcccacg actgctctcc agaatggtgg gcaggaggat gtgacccccc 91860 

acccccaggc accgggacac aacatcttct acacgtgtag gtcttgtgca ctggctttgc 91920 

tttcttcttc caagcaggtc tcccaggaaa tggcacttac agagattgaa gagtttaata 91980 

catgtctcgc tgcctctctt ttcgggaacc ccccagaggg agcagcagaa accagggctg 92040 

gcaggggctc taagctgcct gggcaaagga gcagggggta gcatggagcc ttagccaatt 92100 

tggaaagcac tgtgacccaa gcacattttg cagcagtaat gtcaaattct gccgttcagg 92160 

catgccattg atgtgcacgc tgccacacag aaaccagtga cacaaaggca cagccttctc 92220 

caccctcctg gtgcttagga actaacggct ctaatgagaa atgagagctg aaaggagaga 92280 

gacgggggcg ggccacagca gcgcaggctg gcactgcgtg ttggaggagg ctgacccact 92340 

tctcgtagag gtaaggggcc cactgaaatg tcacttaaat tagccaccac tcccaacact 92400 

agatctcctt tgtccccata cctcagcccc acgcttcttt ctttttttct tctttttctt 924 60 

ctcctctggg gcagcctcaa gcccagcacc cactttttag agctgtaaac caccctggtc 92520 

ctagaagccc tcttacgtta ggggatgaca ggaggtagag atcaggaagg agggagggag 9258 0 

gggaggagga aaggaaaagg gaggggagag agggaaagag atcgagagag catgcattca 92640 

tcacaaagag ccctcttttc tggctttttg actgcactgt gagttattta gccaacaata 92700 

gatgtttatg tattttttta gaacccgtat ttattaacag cctgaaagga gagagacgga 927 60 

gatttatata ggaagtgcag tgagttaagg ggggcaatta agagagcaga aagagatacg 92820 

gaacacagac ttgtaaaggg ttttgtaaca tccaatcaaa ggtgcttcag gtattttcca 92880 

aggaagcaga aggtaaaaaa aaaaaaaaat tgtcccatta gaagctgaca ctggatggag 92940 

caatggccca ggcggaactc ctgcttgaaa gaaggtgaga agggagggac acagaccagg 93000 

atccgatgag ccagagtgtg gccatagctg ggtcatgagg cccagggttg gaaggacccc 93060 

actaaagtgt gcactggcct ttccttgaca aaggatgcac ctatagctag gcgtggtggc 93120 

aagtggttgt tattctagta cttaggaggc tgaggcagga ggatcaccat gagtgtatgc 93180 

ccagcctgga ctgcatagca acacccagtt tcaaaataac aacaaaagga agtgggggtg 93240 

gggagggcaa catttggaat gtaaataaat aaaacatttt ttttaaaaaa agaaaggggc 93300 

tagtgagtta gttcagcggt taagagcgct gactgctctt ccgaaggttc tgagttcaaa 93360 

tcccaacaac cacatggtgg ctcacaacca tccataagga gatctacgcc ctcttctggt 93420 

gtgtttaaag tcagctacaa tgtacttaca tataataata aataaattct ggagtgaggg 93480 

ggccagagca agtagaggtc ctgagtttaa ttcccagcaa ccacatgatg gctcacaacc 93540 

atctgtacaa ttacagtgca ctcatataca taaaataaat aaataaatct ttaaaaaaag 93600 

aagaaagagg gtggggcagg ggagggaaga agaagaaagg taagaagcta aataaaaggc 93660 

acagagatga gcttcatgtg gaaacacagg cctgtagtcc tggcactcag gtggggttgg 93720 

ggggggctac agtgagagta tcatgagttc aaggtcaact tgggtgagac cttgtctcaa 93780 

aaaatacata ngcnaaaaaa aaaaaaaaaa acatagccag gcatgatggt atacatttat 9384 0 

agtcccagca cttagaggac tgaggcaggg cagaaagaaa aggaattcaa gatcaggctg 93900 

agctgtatgc agtcctgatc ctatcccctc cccccccccc ccagagacag acagacagac 93960 

agacagacag agagaaacac aaagaaaggg gccttcagat ggctcagcaa ttaaaggcgc 94020 

ttgctattca gaccccatga cctgagctca aagcctggga cccaaggtag aaggcaagag 94080 

ccaactccac agagctgttc tatgatctct atatgaatgc tggggcatgt gcctacacta 94140 

tgttgtgcac acatgcacag attagaaaaa gaggaggaag aaaaacataa gattgtttca 94200 

agaaaagaaa ggctggcttc ttccacgtca gtgtgagagg agggtctggc ccctttgtag 94260 

ccaggtcctt cccagtccag tgggggctga actgaggcag cggaggaggc aataacggag 94320 

ctttcccaac gcagtgtcca gcaaactcaa ctctacagcc tgtcctgatc cacagagaag 94380 

ccttcctggc tccctcacca atgcgggggc attggctccc aggctcctgg gccccccccc 94440 
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acacctgtgg 
ttggctcttt 
gggttgcccg 
cgcaaatggg 
agcttggaag 
actccagttt 
gaaacccgca 
aggctggggt 
agaggcagag 
accccagcaa 
tactgtgtgc 
tttcttctga 
gactctccaa 
cagttctaca 
tcattttcct 
tgccaggggc 
aagactgcct 
agtgctgagc 
ctagcatgtg 
ggttcatgct 
ccatcctcag 
acaaacaaca 
ccatctcaaa 
ctatcaagtc 
gatggctcag 
gcacccacat 
acacagatat 
aaattggaag 
gaagggggag 
tttggggttc 

gtgggggcc^ 

tattactatt 
ggatgtctgt 
tctggtagtc 
aaccttattt 
gaccctcctg 
tntccggatt 
ccgctggncc 
nnnnnnnnnn 
nnnnnnnnnn 
agctctaact 
caccagcaag 
tggaaacgga 
aggactctgc 
tgctttctga 
gcaatgaaac 
ttatttcctt 
aaaagaaatt 
tctctctctc 
atgaatgcac 
gcaaatggtg 
aatccctttg 
tcctcctcct 
tacccttaag 
tttgatgaag 
agaagggttt 
gcttttggct 
ggatgcccag 
taaaagcagg 
tgaccacagc 
taatcgcttc 
ttattccagg 
gtgccggcag 
ctggggggaa 



agtgctaggt 

ggtaatagac 

tggctccctg 

agagcatgga 

cagcaggtgt 

tttggcacaa 

gttgaggaac 

gtgaagaaga 

tccctgagtg 

acaaaatccc 

cactaaccat 

gactgtgata 

ccccagctga 

ctggttgaga 

gtccccaaag 

caccccatgt 

ggtgggaggt 

ggaaaatggc 

caaggaaggg 

ggtattctca 

ctacatagct 

acaaagcagc 

aagcaagtag 

atgtatatat 

cagttgagag 

ggcagctcac 

ccacgcacat 

taaaactctc 

ggcataggtg 

attatgcaat 

aaggtctcct 

agtgtgtggt 

ggtctgagga 

aggtttgcaa 

tttgcattga 

ttcctgcctc 

gctgcncacc 

ctcccccccc 

nnnnnnnnnn 

nnnnnnnnnn 

gctgttaaat 

gcccaggtcc 

cggaaagggg 

tccttccccc 

ccgcttagcg 

actaatgcag 

tatgtagaaa 

tctcttatga 

tctctctctc 

acaaagtagc 

gtgactggga 

cattcaaacc 

tacctgtctt 

gacaattata 

tcgatattaa 

gggttcgatg 

agggtagaaa 

agaaggtgga 

gatgagctgg 

tcctgccctc 

agaaagccag 

caaggactga 

ggcgggacct 

aggctaccgt 



gatttgctaa 

atgcctagca 

cagctggggt 

acagaggctg 

ctgggaacgg 

ggaacaagct 

agcgggtaag 

gcccatgcag 

gggctccctg 

atccafeaatt 

cagcctaacc 

gtggcgggga 

gccccttgtt 

acactcacgg 

cctagacaat 

ctccactagg 

gagataagga 

agaatgggct 

cctgggttcc 

acattcagga 

agcttgagac 

agaaggccaa 

catgtatctt 

gcatgtatgt 

cacttggatg 

aactgtctgt 

aaacaccaat 

taaggagaca 

tgtggtgtgt 

gtatacatgt 

cctcaatccc 

gtgatgtgtg 

tgctctctta 

agctagtgtc 

acatggaact 

ccccntgtcn 

tccccgccnc 

cccccccccc 

nnnnnnnnnn 

nnnnnnnnnn 

tcacactcct 

agcttgactt 

catggaagca 

catgcctgcc 

gaagacaagg 

tgggcttttc 

gtgcatcttt 

actaatagca 

tgtctcctgt 

ccccccccat 

agtctggcca 

cgggctgctt 

cccatccttt 

ctcttccatc 

agagatgctt 

tcaaagtttt 

agagatggag 

aggatggggg 

gcgctaggag 

ccctccccca 

attggaatgt 

cccaaaccga 

agctgtgcag 

tttatctctt 



tgttgggcaa 

ggagggcgga 

gtctggccag 

tggatgctaa 

gcctgtggcc 

ggctgagccc 

ggtgccatgg 

ggaagccaca 

ggtctcccct 

tgaggtttat 

ctcccccact 

cgggttggga 

ctgcagctca 

ctctctcagc 

ggctggtcca 

ctggcaatgt 

aagggatggt 

gggaggtaac 

atccccagca 

ggtacagtca 

cagcctgggc 

ctggcaagag 

ctttcatttt 

ttgtatgata 

ctctttcaaa 

aattccagtc 

gcacataaaa- 

aaagggactg 

ttaacatgca 

gtgggtgcaa 

tctctgcctt 

tgggtgtgtt 

ccatgttcgt 

tttatctgcc 

tcctgagttg 

nnntcacang 

ccnagcctcc 

cccccccccc 

nnnnnnnnnn 

nnnncagaca 

tcacatcccc 

agagcaaagc 

gaaccagaca 

gtgcaactgg 

ggagcctgtg 

attaagccag 

atatgttgtt 

ctgatacata 

gcatgtgtgt 

attatttctt 

tgcaattctt 

gctgtggcca 

gcagacaatt 

agcaaacacg 

tatttacata 

aaaatcttaa 

atacttattc 

agggagggtc 

tggaaccagt 

gtctctctgt 

gttgctcacc 

tcatggcatc 

aaacagctcc 

ggcagatggc 



catttgccca 

gcttggaggg 

ctgaagaagg 

gcaatatggg 

caggcagatt 

aagaggcaag 

gagcccatgt 

catcccctcg 

tacctaacca 

agagacctca 

gtcttctcta 

gtgtgtgtga 

gtaacacagt 

tccttagaga 

tttgtaagct 

tctctgtcac 

ctcccctggg 

tctgttgcta 

ctacagaacc 

ggaagagcag 

tatgtgagac 

gagtattacg 

tttttacatt 

tgaaagtagg 

gaacctgggt 

tcaggggatc 

taaataattt 

aggggaagtg 

gaatacactt 

gatgtaagct 

attttcattt 

aagccctcac 

gtgggttctg 

gagccacctt 

cctggacagc 

aggacacacn 

tgcctccccg 

cttccccccc 

nnnnnnnnnn 

cacacactca 

acgctaggac 

atcctccccc 

acagcagcct 

ggaggcaaag 

attatgattt 

atttattcaa 

ggaggagcag 

gtggtagcta 

gtgtgtgtgt 

ctgtgggatc 

gccttttctc 

gccctttcac 

atcctcaata 

ggtgttcttt 

aagtcaaata 

ctagaggatg 

tgatgttgct 

cctcctcaag 

cagaagtgtc 

gaaccgccag 

ctccactgct 

tgccaatcag 

gttatggcgc 

ttctctcttt 



cgtggggttc 

gggagtcctg 

agccatggca 

aggcagtcta 

tccagtgagc 

tggtgataat 

gctcatgaag 

agttccaggc 

gtctcccggc 

aaggctgagc 

gctgcccctc 

agccctctcc 

aacacaggct 

gcctgttttc 

tatctgagga 

tgtagtacag 

gttcccacac 

gagtacttgc 

caggcgtggt 

aagttcaagg 

tttgtctcca 

taaagtaaat 

ctataaaggc 

gggctggata 

tcaattccta 

tggcaccctc 

ttaaaaaaag 

ggaggggcat 

ctataaaagc 

gtgcatatgt 

aaattataat 

ggcaatcaga 

tggatggaac 

gctggccttc 

aagtccccaa 

gcttantggg 

cccctcgccc 

ccccnnnnnn 

nnnnnnnnnn 

aattaaatat 

tctaaggagg 

ctccacacaa 

agccaagccc 

ccccagccgg 

ctgctgattt 

tctaaagatt 

agatgtgata 

tgcctaggcc 

gtgtgtgtgt 

tccagactca 

ttgccagccc 

ctggagtcct 

actagccaat 

ccttgagtct 

gctccctttt 

ggtgtagagg 

ttaaaaggta 

ctaatgaatc 

tgcctttgac 

cattaggagc 

cagaaaacct 

gaggccaaag 

gcagaaaaag 

gatgctttgg 
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96300 
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96720 

96780 
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96960 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
cctctctctc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 



ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagtcatt 
aggagacgcc 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgcagggct 
gagggagggt 
ccccgccccc 
gcagcttaca 
agaggaatga 
ggccgagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgcggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggcagg 
gagaattcaa 



cacttgactt 
aaaaaaaaag 
agctgttttc 
atagcaggag 
ccacagagag 
gtgcacacat 
tgaggtgtgt 
tgtgtgtgga 
acatgccaac 
gggaccaagt 
ctcagccgcc 
tttggaagtg 
gagggaggaa 
aagacaatga 
ggcttagcca 
ggaagagttg 
gccaggcagc 
tctccttccc 
gagagaggcc 
ggtctttagg 
acagagctcg 
gtaaggttta 
gcctaccctt 
agacctggca 
aacttgcata 
acaactgggc 
agaaaggcct 
aggcgagcat 
tctgctccat 
aattagaccc 
agtggcagga 
tctcttaacc 
cctctggtgg 
gactcgggat 
tgatttattt 
cttcctgcgt 
gctgccttgc 
atgacctcct 
ctctctatgg 
gacgctgcta 
aaatgttgcg 
gtggacgggt 
ggggacacac 
gccctcactc 
gggcttgttg 
aagagaaagc 
atccatgacg 
gccacaagag 
gtccttagag 
agaggttccc 
tcggaacagc 
aagggcgggg 
gggttgatct 
gctggctcag 
aaactttgtt 
caactcactg 
gaagtggatg 
atggctgaga 
ggtttttgga 
gtgcagatgg 
aggcactaaa 
ttgagggtaa 
agtcccaatg 
gatggcagag 



gacctaggca 

ccccaaacca 

ttcccaagct 

tcccaattag 

cagaggcaat 

gtgtgtcaac 

gcaccccagg 

ggggtgttgc 

aaacaggctg 

gctttgcaag 

acccagggcc 

ctgatgtcaa 

aagggggggg 

ccacaaggtc 

atatagctga 

gacatgggtc 

catgcccacc 

tgggtgccta 

actgggactt 

gacactgaag 

catgacaact 

ggagagcccc 

ctgaactcta 

ggggagcaag 

caagaaactg 

aacagatcct 

cagtctcagg- 

cttctgctcc 

gaactcatac 

cgaaaaatag 

gcgcaggatg 

tctcgagtcc 

gctggtggaa 

gccatgtaag 

acctcccgtg 

ggcacggcag 

caggatgaca 

gccccgtgag 

gggaggctga 

gtaattaagg 

tcatttaaag 

ggcaattagg 

cttccggtaa 

cccacccccc 

cccttaccca 

cagaaaagtt 

gtccgcgcgc 

cttcttgagt 

cacaagccag 

taaagaatga 

cggaccgtta 

tcacccagca 

gctcctgaaa 

ccagcttatc 

tcaagagcag 

cccgtgacaa 

acaacagctg 

gccttgggct 

aatgtccttc 

ccctgtgttc 

cacacaatgt 

aggagtgtaa 

ccagcaaata 

ggacaggctg 



aaaatagcag 
caacacaacc 
ctacagcgtg 
cgtgttgggt 
attcagcata 
gcacatgtgg 
ccttttgatt 
tcacagtgca 
cacactcttg 
acagcaggca 
agctctgatc 
tgcagaattc 
ggacgggacg 
cctgtaacta 
gacccagtct 
tccattctct 
gatgccatcc 
gaggtgctgt 
ggtagagagg 
gcagtttcct 
tgccggctct 
agatgtaagt 
acatggtcca 
cctttgtttc 
cctctgggcc 
tccaggtggc 
atttcacaca 
ccccaaacaa 
agttttcaga 
gttagctgac 
ggctgccagg 
ctgggtttct 
agcgtaatga 
tacacagcat 
aaagagacaa 
caccccctcc 
gcagcccact 
gtctgtcacc 
ggaaatgaat 
agcgaagcat 
agtgtcctgg 
ctggcacggg 
ttgctgctgg 
acttctttcc 
gaaaatcacg 
catattaggt 
atataaccct 
tgccacctgc 
gcctgattct 
tgcttctatt 
ccttagctct 
ttgccactcc 
aacagtgttg 
tccccggtgc 
agtgtggtgc 
ccagtacttg 
tatccaggtc 
gcatccctcc 
ctgtgggttg 
ctggtgctgc 
ggttgtatct 
ggactgtgta 
caatggactc 
tctgagtttc 



cgagatacag 
cggaagtgtg 
gtggctctaa 
aatctttcaa 
aagccctaga 
cttgtggagg 
ctcaaagctt 
gaaacccaaa 
ctagtacatc 
cagagttctg 
tggcttgacc 
agcagagtgg 
ggccgaggcc 
cactaactgc 
tggtgctgtg 
ctcctcatcc 
ttctgggagg 
gtctgcatcc 
ttctccacac 
ctgttcagtg 
caactcccat 
atggatggga 
gcttttccat 
tgacccattt 
tctcctggac 
tgcagtgaca 
ctgtatttcc 
gtgtgggaat 
taccgaggag 
aattacttgt 
gctgcagtct 
tgccgggatg 
atcaacgctt 
cgccagccac 
caatcatcat 
ctctccgctg 
ctcagcctct 
cgaaaaccac 
tccgtaggta 
aggtgcgtga 
tgcttcagtt 
tagagctcac 
gcagctctgg 
tgacagctct 
ttggaagaaa 
tcggatctgc 
gtggcaacag 
caggagacag 
ccaatattga 
tccatgctaa 
gctgttctcc 
ccccaccctc 
gctcaagttt 
tttcaagatc 
caactctgat 
gctgttgtgg 
caaacagagt 
tcccctcctg 
tgtgcctttt 
tgttgagatg 
gactagaagc 
aagagatgag 
tcctggtcga 
ctatggctgc 



98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
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100740 
100800 
100860 
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101220 
101280 
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101880 
101940 
102000 
102060 
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accgataaat 

gactagaaat 

tcacactgtg 

gggacacatt 

gagaaatggc 

ctagcagcca 

ctaatacata 

cacacacaca 

ggctcagcaa 

catggtggct 

acagatacaa 

gaaagaaaga 

aaagaaagaa 

acttccaggg 

gaagctgtag 

gtagacaacc 

caacgcctca 

taggacactc 

agacctatct 

catccaactc 

atctgctctg 

gaggactttg 

ctctaccctg 

ctgtgcccac 

agactgagaa 

aaatgtctta 

tttgtgtgtt 

taactaagga 

caggtgtgtg 

tgcatattag 

tttgagacaa 

accagaagaa 

atttgaactc 

agtccttgag 

gcttcagcct 

cttccttccc 

cttcccttcc 

cttcctttcc 

taatcctggc 

nnnnnnnnnn 

nnnnnnnnnn 

cgacaacacc 

ccaattttta 

atagctcctt 

gtcctagaac 

atgatagact 

aaagtcatcc 

ctcaaaaaga 

gggaggcaga 

ccaggacagc 

agtgaacccc 

tgtttaccca 

tgtttgaatg 

ggcagtattg 

gagtttaaag 

tggcctgctc 

ctgtttcctc 

tatactacac 

tctagagatg 

cctccatctg 

atgagacctt 

tggtggtggc 

agaaaggctt 

aataatgatg 



ggtcataagc 

ctagctcaag 

acatgatgct 

catggaccgc 

ctggtagtta 

tgttgggcag 

catacataca 

cacacacaca 

ttaagagcac 

cacaaccatc 

tgtactcata 

aagaaagtgt 

agaaagaaag 

ctataggcaa 

acaacctgtg 

tgtggcctgg 

ccctctgtgc 

tacatcaaga 

ttacatgtga 

actatgtgac 

agagtcccct 

gatctgcaaa 

aaaccctttt 

aagaagtccc 

cattgtgtct 

ctgcctttat 

tgtttgtttt 

tgactttaaa 

ccaccgtacc 

gtaagcatgc 

tgtctttttt 

ggcattggat 

aggacctctg 

acaatgtctt 

cctaaatact 

tttcccttcc 

cttcccttcc 

ctctctctct 

tgtcttggaa 

nnnnnnnnnn 

nnnnnnnnnn 

ggcgccgctg 

aaaccagtgt 

actttgtttt 

tcaatgtgta 

gatggtgtaa 

ttggctatgt 

aataaatgta 

ggcaggcaga 

cagggctaca 

aacagtactg 

ggtgtaccca 

agagaaaagt 

gctggagagg 

cttcctccac 

ttggctccgg 

ccccacccct 

tggtgatctt 

tcctttcatg 

aactttagga 

ggagcctgca 

agtagtagaa 

tgggtgcaca 

acgatgaaaa 



agagtagagg 

gttgtgtgtg 

gtggtctgca 

aggttacaca 

agtatgcttg 

cttacaacta 

tacatacata 

cacacacact 

tgactgctct 

tgtaatggga 

tacattaaat 

aaacgaggaa 

aaggaaagga 

cctgtggcct 

gcctggggaa 

ggaagctgta 

ccaatttcct 

tgatcttgtc 

ggtcactcca 

agagtcatct 

gcctgcccgt 

gccacatctt 

ctctgaggtg 

ggggagaagg 

cacctaaaat 

tccttttcct 

cttctgagat 

cttctgatcc 

ccagctttat 

taccaacttg 

tttttaatta 

cctattagag 

gaagagcagt 

gctatatggc 

gggattacat 

cttttccctt 

cttcccctcc 

ctctccccct 

cttgctctgt 

nnnnnnnnnn 

nnnnnnaaga 

cctccactgc 

ttcaagccgg 

gctttgacgt 

aattaggctg 

aactccactt 

tgtgagtttg 

ttgccgggca 

ttttctgagt 

cagagaaacc 

ccggacagtc 

caaggtgtgt 

ttcccatcag 

tgatggggag 

gaagaccact 

cctcagttta 

ctccctgtct 

gactgtattt 

gctgggacct 

aacttcttgg 

ctttgttaag 

ctctgatgaa 

tggctataac 

taattctctg 



aaaaccacag 

gcagggttgg 

gatgtgttgg 

tgctatttaa 

ctgatcttcc 

actatgactt 

catacataca 

ttaaagaaaa 

tacagaggtg 

tctgatgccc 

aaataaataa 

aattcctaat 

attctgaggg 

ggggaagctg 

gctgtagaca 

gacaacctgt 

gttctctaag 

tcaagatgtt 

caggttctag 

agagatttgt 

gggaactccc 

ccaaaaccat 

gcttttagag 

gacccaaggg 

cggtggtcgc 

ccgctccatc 

gtagcccagg 

tttcttccct 

ttgagactat 

gctatattcc 

tatgagtaca 

atggttgtga 

cagtgctttt 

acatattggc 

gtgagccatg 

ccctttccct 

cctcccttcc 

ctttcttttc 

agaccaggct 

nnnnnnnnnn 

agaagaagaa 

catccacctg 

tacactgaag 

tttgtgacat 

gccttgaact 

aggaggcaga 

aggaccaacc 

tggtggcaca 

tcgaagccag 

ctgtctccaa 

tggtgtcttt 

gagcagctct 

ctcgggtgtg 

gtgtagcctt 

gggctgatct 

tctaaaattt 

gtattccctg 

cagtttacct 

ggctcagaaa 

cttaaaggtg 

caccctgggt 

cagttagtta 

tcagtggcga 

tgactgttct 



acagaaattc 

ttcctgggtg 

gctgcatcca 

aaactccaag 

agaagacctg 

ctgagctcca 

tacatacata 

aaattctggg 

ctgagttcaa 

tcttctggfcg 

gaaagaaaga 

taaaaaagaa 

agaatctgcc 

tagacaacct 

acctgtggcc 

ggcctgtggc 

gacacatgcc 

taacaaaatt 

acatattttt 

gtccaggaca 

cagtggtcct 

tttcctcttt 

aggcaggtct , 

ccagtgctga 

aaggaccaag 

ttactcctca 

ctggccttca 

ccacttccag 

gattcaggct 

cagcctttct 

ctgtatctgt 

gccaccatgt 

aaccgctgag 

ctcaaactca 

gtgtttggct 

tttccctttc 

cttcccttcc 

tttcagagag 

ggcttgnnnn 

nnnnnnnnnn 

gaagaagaag 

agacaggact 

tagtagtccc 

ggtgtgatgt 

tgcttctgcc 

ggtgggagga 

ttggctacat 

cgcctttaat 

cctggtctac 

aaacaaaaaa 

cctaagtctc 

atacccagag 

tgaatactcg 

ccggtggaga 

tgttaccaac 

acatgttacc 

ccctctagtg 

cttgttctct 

tttctaaggc 

tatttctgac 

ggtggtggtg 

ttcaaggccc 

gagacgtgct 

ccccacttcc 



atgccattga 

ttcaaccttt 

tagctaccct 

ggaagggcta 

agctctgttc 

aagctctctt 

cgtacacaca 

ttggagaggt 

ctctcaacca 

tgcgctgaag 

aagaaagaaa 

agaaagaaag 

ccttttccta 

gtggcctggg 

tggggaagct 

agcatcatgt 

atcaaatgca 

acatctgcaa 

gaggagccac 

gactggctgt 

taagggccct 

tggagagcta 

cagcagggca 

actatcgctg 

caggctctat 

tttttgtttg 

gctcactatg 

agtcctgggg 

ccatacttca 

ttctttcttc 

tttcagacac 

ggttgttggg 

ccatctcgcc 

gaatccttcc 

tctagccttt 

ctttccttcc 

cttcccttcc 

tttctctgtg 

nnnnnnnnnn 

nnnnnnnnnn 

acaacaacga 

caaatccaga 

acttgggatt 

agtcttggct 

tcctgctggg 

tcagaaattc 

gatatcctat 

cccagcactt 

agagtgagtt 

caaacaaaaa 

ctttcaactc 

gtgatacggt 

gtccccagtt 

tgggctttgg 

agaattggat 

tgatcaaaaa 

gtgctggctg 

ctctgctgac 

actgagcctt 

ttagtatgca 

gtggtggtgg 

atctagaaaa 

tcccatgaat 

ctctctctca 



102180 

102240 

102300 

102360 

102420 

102480 

102540 

102600 

102660 

102720 

102780 

102840 

102900 

102960 

103020 

103080 

103140 

103200 

103260 

103320 

103380 

103440 

103500 

103560 

103620 

103680 

103740 

103800 

103860 

103920 

103980 

104040 

104100 

104160 

104220 

104280 

104340 

104400 

104460 

104520 

104580 

104640 

104700 

104760 

104820 

104880 

104940 

105000 

105060 

105120 

105180 

105240 

105300 

105360 

105420 

105480 

105540 

105600 

105660 

105720 

105780 

105840 

105900 

105960 
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ccctagctct 

cggaacgtta 

tggctagtca 

tcagcaagaa 

gctaagtggt 

ctacactgtt 

aactggaatt 

gggatatttg 

gtagaggcca 

ctctcgatca 

ttggccttga 

agaggcctcg 

gcttgttgaa 

tctgccaagt 

cttggagctg 

taagtggcat 

atcagctgac 

ggattctgac 

tccctgcatc 

ctccccagca 

atctcctcag 

ttgatgaaga 

ccacccatgg 

gtaagaagga 

ctggggcgga 

tttgaagcaa 

ctatctcagg 

gctcttagaa 

ggagggagtg 

gccaagtcat 

cccctaggga 

ggcctgagtc 

ggtttctggt 

tctctaaaag 

ttcagcgact 

ctagacctcc 

ctcgagctag 

gaaaaaaaaa 

gtggctggct 

ttagctcagg 

tgtagccctt 

acagagcgcg 

gcactgcaaa 

gctggtctaa 

ttggttagtt 

ccaatcagag 

tgggtgtggt 

ctcgtgcgtc 

ggaagagaca 

agcaaccccc 

aaccgcggta 

ggggagggga 

ccgagtagtg 

ttggagaagt 

cgccctttgg 

gaggggtgcg 

cctggcaggg 

ttatgtaagg 

aaaggttagg 

ctggaaactc 

gtcacctttg 

ctgactgcct 

cctctctccc 

cgttcccacc 



tatctaccga 
gtcacttccc 
ctgaggaaaa 
agaagctgac 
ttcaagaagt 
ttgtagatga 
tgtttggcta 
gggacacttc 
gggtggtact 
aaacatcaac 
ggtgggtggg 
agtgaggagg 
acttttcttt 
cagtcccggg 
gggcaacaca 
ctgagaagtt 
ctaggaagga 
ccactaggct 
atccgcagga 
gtggcctccc 
aagtctgaat 
aaagatcatg 
cggggttgag 
ggatgtggtt 
aggcaggaag 
gcccagcagt 
cttccctctc 
atgactcaaa 
aagaggtggt 
tgggttcatt 
aggagccgcc 
agagaagcgt 
tgatactact 
tgagggcgaa 
aagacgagac 
agtgaagatt 
ctctccagtg 
aagtcctatt 
tccatcttta 
cctcctgtga 
taaggacttc 
ctgcattctg 
aatacacgcc 
gcaagtcgcc 
ggtaggcgtg 
cccagacgcc 
cacgttgccc 
acgacgccgc 
gagcggtcgg 
gcacccagca 
accgggtgat 
gagggggcgg 
tgtctggacg 
aactggggag 
tttgaggatg 
gaggtgcatt 
aaggggtcgc 
ggggagggga 
gtttcaccgc 
tacccgcacc 
cacctccccc 
gggaagccgg 
cacctcctgg 
tccctctggc 



atccctgcac 

ttcgcctact 

cggcagctgc 

agcaaatgtg 

ttcttcttaa 

agaaaacacc 

attcagtggt 

tggtcgtcat 

gaccttccta 

agtgctaagg 

aggaggttag 

gaacagggct 

cttccaagct 

acttcgcact 

tttacctggg 

aaaagtgggg 

agagagcaac 

tatcttccac 

gcccctggag 

cactaactga 

aagtaaggga 

agaagcatac 

ggggaagggc 

agggctgatt 

tctccttaaa 

caccttagtg 

atttctttgt 

agtcaagaac 

cccagtcttg 

tcggtttttg 

tcttcattgg 

ttcggagggc 

ttggtaaaga 

tcctttgtta 

attcactaga 

gggcctgtgc 

aagactgggg 

tgtggaaaaa 

gttctactca 

atcagttttg 

cccgcgaccg 

ggaagctgag 

tatacttcct 

taggccgagg 

gcttagagaa 

tgaatgggcg 

agcaatgggc 

cagctgatcg 

ccgtgcggac 

tcaggtgggt 

ggggaaagta 

cggggacagg 

tttaaagaga 

gaatatgaga 

cccgagctga 

gcctctaccg 

tgggcggtcc 

aaggagtagg 

gggttgggcg 

ccccctccca 

ctagtatgtc 

ctggctgggt 

ccaactcttg 

cctagtttca 



aagcacccat 

tcagtattgt 

ctgtgggcca 

ccaactgtgg 

ccccctacaa 

attccgaagc 

tctcaaccag 

aattgggatc 

ctatgaacag 

ttgagaaatc 

agagtggaat 

cctatgctag 

tggctagagc 

aagtttgtcc 

tcttcccagg 

tgagtgggta 

tgaagcagca 

agcctttctt 

cacccacatt 

cagtggtgac 

cagatgttgg 

attttacccg 

agaaaaaagg 

cgtcccctgg 

gacacaatat 

gagcccatac 

ctcagatgct 

cccaggccac 

tccctttaaa 

cccatccccc 

tctccacctt 

gggactgaat 

tcttccccta 

agaacgtgcc 

aaagatttca 

gggtgacatt 

ccgtgcgagt 

aaaaaagact 

ctcctgttgc 

aggctaaaag 

agtcagagat 

tgtcaccgta 

tctgctcttt 

gttagccaca 

gctcctccag 

ggagtaagca 

ggtgattggc 

gagactggag 

aggtcgcagt 

gtgatctggg 

gggtcctgac 

ggcgctcttg 

gagtcccgga 

ggccagaggg 

ccatttagcc 

gcgctgactg 

ggcccctcct 

gggcggcggt 

gaggttgggg 

gcctaactgg 

cggtagagag 

ggggcgcctg 

gcccctcccc 

gtctccaact 



ggggtttaca 

gtttccagaa 

ccagcccatg 

gtctgctgga 

gaaaccacaa 

tcactgccag 

ggtggatttg 

atgtgctact 

ggcagccatg 

tcaggctgaa 

aaaatcagaa 

ggataatgga 

cctgctcaat 

tggagtgacc 

agtgggttaa 

aaggcaggag 

aagagctggt 

gtttaggctt 

cagccggccg 

aggaaatatc 

ggagaggcgt 

ctatggttgg 

agatggagaa 

ggtcgaacaa 

tctgaacgtt 

ttaatttaac 

ctcacttaga 

aagtttctct 

taagcaattc 

gcctttcaga 

tgaaatcact 

gggtgttaat 

atttttaaaa 

ccttgagaag 

ctaaacccac 

tgtccctata 

gacagtggtc 

tcgggtgttc 

ttcgcgtgct 

aagttccaag 

cagtttaaaa 

agaacttcat 

aaactgtagt 

ccttttcagc 

gcaagggggt 

gaggtgctgg 

cctgggtggt 

ccggtgtgtg 

gattttgctc 

gacccggtca 

ggccacaccc 

ggagaggagc 

caggagtcgt 

ccgggggcgt 

tagggaggat 

ggtcagggcc 

ctcgttccct 

gcggaggcct 

ggggcggaca 

ctgtcttgga 

gcccctagcc 

ggttagtcat 

acggcctccg 

catttggcct 



gaatctgggg 

gtacccattt 

ccaagtgagg 

tttctactgt 

attttattat 

tcagcactgg 

gactccccag 

ggcacctagg 

tacataaatt 

accctgtcat _ 

gggccaccac 

gaatagggca 

ttcccccaac 

ctgactccag 

aagtcaaaga 

gaccccatca 

ccaggagact 

gggctcagtt 

ccaggacagg 

tcccattcca 

cactcttggg 

ggttccatta 

ggacagacac 

tcagctgtta 

gaactcagga 

acagaagcgg 

aagcttagat 

ttgggggtgg 

agcagctttt 

ctctgattgg 

tccctaagta 

cttagaaccg 

agacgcttcc 

ccgtgggctc 

gagggataga 

ccccgaagac 

cctatacccc 

tgctgcatcg 

ccaccttcgc 

aaggaggggc 

atgccaactc 

tgaccggaat 

ttgacgtaaa 

cattggccag 

ggcctccttg 

cgcccccgag 

tcattcgcag 

ctgggcgctg 

ctctgtccac 

tcccgggggg 

tgcccttctg 

ctggactctc 

ggcagaaggt 

ctaaccccga 

ctggacgagc 

agttcaagtc 

cccggggatg 

tatgcaaccc 

ggaggagtgc 

cagagagaag 

cgggcttggc 

cgctgggctc 

gttaggctaa 

gtcaccctgg 



106020 

106080 

106140 

106200 

106260 

106320 

106380 

106440 

106500 

106560 

106620 

106680 

106740 

106800 

106860 

106920 

106980 

107040 

107100 

107160 

107220 

107280 

107340 

107400 

107460 

107520 

107580 

107640 

107700 

107760 

107820 

107880 

107940 

108000 

108060 

108120 

108180 

108240 

108300 

108360 

108420 

108480 

108540 

108600 

108660 

108720 

108780 

108840 

108900 

108960 

109020 

109080 

109140 

109200 

109260 

109320 

109380 

109440 

109500 

109560 

109620 

109680 

109740 

109800 
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ctgttagagt 

gggtcagtgc 

tcagcagcca 

gagactgatc 

ggaatttgaa 

aggcaaagtc 

gaaccaccct 

ccatgctgta 

cttggaactt 

cagatgtacc 

cagccaccct 

tcctgagagc 

aaccctcctt 

atttctgaag 

atgccgaaga 

aggggcacaa 

gaaggtactg 

acccagcttt 

ccctcacctg 

aaagctgcca 

gcagcagctt 

tggggtgatg 

gcccagtgaa 

ctccccaaga 

tgtttactga 

cacacacaca 

cacacacttt 

actccgttct 

tcgagacagg 

tggcctcgaa 

tgccaccacc 

tgggtttagg 

ccatagataa 

tcttcttctt 

caccccctga 

gtcccaggca 

ccaaggcttt 

tgccatttat 

gcaagaaatt 

cctgattcat 

tcctaacaga 

ggagttctag 

ctaatccagc 

cgggctgggg 

cgcttgcttt 

atggctctgc 

caacagaggc 

acattggtca 

gcctcagtga 

tgaagggcct 

aggagcatga 

gtaacactgt 

aacatgcacc 

ggtctgcaat 

ctgctataca 

caaaataaaa 

tccctgatca 

tggcagtacc 

gtatcctaaa 

tggaacgcag 

tagggcttgc 

gtcttacaga 

gtatgagttt 

accatccaat 



aggctagaag 

cctgggccca 

gtgaaccctg 

acctaggaga 

agttgcccac 

tccttggccc 

ctcgggctcc 

gggccccggc 

ggaacaaagt 

cggaatgtgc 

acccatacct 

ccctgctgac 

gctgctgaat 

tacatttttt 

tgctgggctc 

agcaagtgca 

aaagtcagct 

cctgtttttt 

gctgagccgc 

gttgaagaac 

tgcctatcat 

gctttatggg 

agatttaggt 

ttagttgatg 

gacttccttg 

cacacacaca 

cctttccatg 

aagtagtcat 

gtttctctgt 

ctcagaaatc 

gcccggctgt 

ggtacttacc 

gtaggcagga 

ctcgtggatg 

ctactcaccc 

agctcgtggg 

ccctccacct 

tttctgcctt 

gtcaggggag 

tctcacgtat 

agatgggttc 

cctgttcccc 

cccactgcag 

aggctgagtc 

ctttcctcct 

tgtcctcatc 

aggcacaaga 

gagaggcacg 

aagacacgtg 

ccaggaagta 

gccacctctg 

gggaaaacat 

aagccctggt 

cccggcactc 

atgagttctg 

taaagcattg 

taatacaaca 

aaccattgtg 

ataaggattt 

ctgtttgttt 

ctaagtaaat 

agaaaaattg 

gtgactgttt 

tgagtggaaa 



ctgtcatggt 

ccccgccccg 

tggcctgaac 

aggaagatcc 

cctagtgtaa 

ttctatctgg 

cagccctcta 

ttttactgct 

tcagacgtgg 

aagcggaatg 

ccctcccctc 

cttccaactc 

tccctaagaa 

tttttaactt 

cttagcaggt 

tttagaagcc 

agagccaggt 

tttttcctcc 

agtagttctt 

tgttgccctc 

ccggaaggtg 

agggaaaccc 

taaaggcact 

tctgtgtggc 

taggcctgcc 

cacacacaca 

aggtccaaaa 

ttgtgtattt 

atagccctgg 

tgcctgcctc 

atttacattt 

cttacacctg 

gggcattaaa 

gaaacgaaac 

agctcagggg 

aggctagtcc 

aaagcaaaac 

tgctttccag 

atgggttgta 

gggcttggtc 

agggggtaca 

atttgtgaga 

tctaagctga 

ggctcacaca 

cccttccctg 

caacatggag 

cagtggagga 

tagaagcctg 

cttccagcca 

tggtcccatc 

aaaggaaata 

atctatgact 

tctgtcttct 

tggaggtggt 

agccagcctg 

gttagtaatt 

agcaaatgaa 

agagatgcct 

ggttataaaa 

ggaagtccat 

tatattcatc 

agtggaaata 

ttaaaatatg 

tataaatact 



gccagagagt 

cagccaaggg 

agagctatcc 

gacaaagttt 

tctttccact 

cagtggccat 

gcctgccacg 

gattcatgcg 

aggggccggc 

cctggcatct 

ctgcctttgg 

tagtgcccct 

caagtcattt 

gggacttggg 

tgccaagagt 

tcttgcttct 

ttggatggcc 

ccttcctttt 

cagtggcaag 

tgcccctggc 

acagaactgg 

tggtcctctg 

gtctataaat 

agtgggaaga 

tttcttatct 

catacacaca 

gtaaatgtac- 

actttttttg 

ctgtcctgga 

tgcctcccaa 

ctttatttat 

tggatttttc 

agtccaccag 

agctcttcac 

gattaggatg 

tctactggct 

tgtagctctt 

gatagtgaga 

tgatatgagt 

tctgattgtg 

ggaggctgtt 

aactgaaagt 

gggataggat 

ttgcgacaaa 

gccacagtgt 

cctcagaggt 

cctggcctgg 

gagaacacca 

tctcctctca 

tctaccctgc 

cacagcaaat 

ggggttgtag 

gcattgcata 

ggcaaaggag 

ggctatatga 

caaagaaagc 

agccagaaga 

ttggacctgg 

tgttcatctc 

tccttttctg 

tgatggtggt 

taaatattga 

aacacatact 

gaatactaaa 



tgatggagca 
cacctgcttg 
tgggcagaga 
atacttccca 
ctctgaaaat 
gtccttggac 
cccccagccc 
ttggaactgt 
agacagcctg 
ctagtcctga 
tcagctgtcc 
cccatttcta 
gagttgatca 
ttctacaccc 
tgccagctcc 
tattcaagaa 
tctgggtcgc 
aggaacctgt 
ctttatgtcc 
ttcgtggagg 
ggtgggaagg 
gggagccctt 

tggggaatag 

aatagaagga 
tcatcatcac 
catacacaca 
tcaggaaggg 
tttatttgtt 
actcactttg 
gtgctgggat 
ttttagtctg 
cacctgtata 
tggtgactca 
atgaactgtt 
gaaggaaagg 
tctcaccatg 
ggttgggttc 
ctctgctcaa 
cccttctgct 
gttcaccttt 
tgttgtattt 
cataggggag 
gtgtaaggga 
gattgccctt 
gtccctccag 
gagaaagggc 
aaccacaagg 
ggaaagagag 
ggacctgcct 
agtttctata 
tcaaaaagag 
ctcagttggt 
aaactgaaca 
cctacattca 
gactgtctca 
agatgtggct 
aggctcctgt 
tagtttgctg 
a gggttgtaa 
ctgtcatgaa 
gttctgtgca 
atactaaaaa 
tgtaatatat 
aagattatga 



gctggtcaga 

gcacaaactc 

gaagtggaca 

agaggctttt 

agaaatccca 

tgactgtgca 

cctccctgag 

gggggcgggg 

gaattcatac 

ggaagctgcc 

tccctcagac 

accctacaca 

cagagctcat 

tgccctttga 

tagtctgtaa 

cccctcatta 

tggccctgtc 

gcctcccaca 

tgacccagct 

aagaggagaa 

tctggacagc 

acccccactg 

gtgactccac 

aaagtctgtc 

catgccaaca 

cacacacaca 

ggacattgaa 

tgattgactt 

tagactaggc 

taaaggcgtg 

gcccagattt 

atggggaatc 

gagcctgggc- 

gtccttcccc 

ctatggttaa 

catgggtggt 

tagcaaccac 

tactgtgcag 

gcctctagct 

ggcccagtct 

gacaggagga 

actagatcat 

ctgtagcaga 

ccctcgacct 

cactgggtac 

agcctggaag 

gcctatccgg 

cagccagcca 

tcctgggaga 

aacagcctca 

attcaaatgt 

aggtttgctt 

ggttggccca 

aggtaatcct 

aaaaataaaa 

gaaaccgttt 

gccttgtgtg 

tcttagaaat 

tagagaaaaa 

aatgtatagc 

gccatccaaa 

gattataaaa 

ttttttaaaa 

aaagtatgag 



109860 

109920 

109980 

110040 

110100 

110160 

110220 

110280 

110340 

110400 

110460 

110520 

110580 

110640 

110700 

110760 

110820 

110880 

110940 

111000 

111060 

111120 

111180 

111240 

111300 

111360 

111420 

111480 

111540 

111600 

111660 

111720 

111780 

111840 

111900 

111960 

112020 

112080 

112140 

112200 

112260 

112320 

112380 

112440 

112500 

112560 

112620 

112680 

112740 

112800 

112860 

112920 

112980 

113040 

113100 

113160 

113220 

113280 

113340 

113400 

113460 

113520 

113580 

113640 
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tttgtgactg 

aaagtggatt 

attttcccct 

ttaaataaaa 

tgcctgtgag 

gccgtctctt 

ctttgtgcac 

ctggagctgc 

ctagaagagc 

catgtttaga 

aagaaaatgt 

tgtgtgtgta 

ctccaccaca 

tacacacagg 

agaataagaa 

nnnnnnnnnn 

nnnnnnnnnn 

aacatagctt 

ttaattttta 

gttaatgcaa 

taaaagtcat 

atgcacacat 

taaataattt 

tgctgctttt 

ttagaaaagt 

cgtcagcaca 

caactcactt 

cacctgaagg 

aagccttagc 

tatttctcga 

caaaaaaatc 

ggttaggggc 

actctgacta 

ggttgtatag 

aaagatttat 

gatcccatta 

ttggaagagc 

taaaagttta 

tgtgtttgaa 

ttgattcttt 

cagctgagct 

agaaggaatt 

gtatgatatg 

cagatcacta 

taatcaaaat 

cctgtaacct 

aggacaacag 

gaacaaaacc 

actctgggac 

agcttccacc 

ctagcggaga 

gggtgcatga 

gaagccatgg 

gtaatgaatg 

gaaacttgga 

ttaaaagaaa 

aagcatcaac 

tcgggctggt 

ttcaaatccc 

ttctggagtg 

aagtattaaa 

ttttgttcct 

tgctaaatag 

tggcctaggc 



tttttaaaat 

caaaatgtta 

tgttttttct 

tacaaaagac 

tggcaggaat 

cactccctct 

cagagtgtgg 

acgctgtttg 

agcaaatgct 

ttttggagaa 

ttgcttggga 

tgtgtgtgtg 

ctaaagtcaa 

aagcattcct 

agaagggaaa 

nnnnnnnnnn 

nnnnnnnnnn 

aaagaattct 

atgggaaaaa 

acaggacaga 

ctatttatac 

acaaacagag 

taagctctaa 

gccattcttt 

aaggagcaca 

cagcaatgtc 

ggactggtac 

ctgcagcagg 

tgcagtggaa 

agtgaatgta 

caacttttgt 

acaagctccc 

ctacacctgg 

caaattttta 

ttttatgagt 

cagatggttg 

agtcagtgct 

aaattaattc 

gtatatatgg 

cctttcactg 

atcttgtggg 

catgtaattg 

gtaagaaaaa 

tgttcaagga 

gatcccatcc 

caagtccccg 

gatatzctgag 

aacaacttca 
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ctggaccatt 

ggggtgaggc 

ccagttccgg 

cctatgcaaa 

caggcaccat 

gagacaggct 

aaggagtatc 

cttttaggtg 

ttcaccaagc 

acagtgtggc 

tgagccacca 

cttaaccgct 

tcattatgca 

ggatgtagtt 

tgggtcctcg 

ccaccaaact 

aaatagaaaa 

ccaagaaaaa 

tggtgtccca 

agacctgcta 

ggcagcacag 

aaggagtccc 

ccatgatgaa 

acactacagc 

tgtcacacac 

gtggagggca 

acaaagaacc 

cagcactctg 

agagatgata 

aaatgtaaca 

aggcacagaa 

caggaacagt 

agcggttaag 

tggtggctca 

gctacaagtg 

taattggaat 

ttccgagcct 

ccttgaactt 

gactcaggct 



acttgtaata 

tttttgtatg 

tttttaaata 

tttaaaattt 

cttctgaaag 

aaaaatatat 

tgcagaggcc 

tgtggatgct 

aaagccatct 

ggaggatggg 

ttcttgagtg 

ctaggaatca 

ttcattaatt 

gactgtcagc 

ctgtcactct 
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tattttacca 

aaaaaagtcc 

agaaatccat 

caaattaaac 

gtgaattagc 

cttaagccta 

atagtgaaac 

tcccaaaggg 

tggtgacctg 

tgtgaggagc 

tggcgatggg- 

caggttttcc 

cacacaggaa 

ggcacgcatc 

cccttcattg 

tgctctccta 

ggttctgagg 

cagacaaccg 

tgtcttcaga 

tgtggttgct 

gagccatctc 

cgtgccacag 

atgtgacggc 

gagcgtcaat 

tatttttaaa 

gcagagttga 

acctctacaa 

gaaaagggtc 

cagagaagct 

cagagctgtg 

agtgagcgtg 

catcatgaag 

ttccacctgg 

tacagcttcc 
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aataagttag 

gaggcaggca 

aagtcccatg 

aatgcatggt 

tgtaaggtga 

gtattttaac 

agcaccgact 
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tattctttgc 

gtttcatgta 

ctgattcccc 

cagtgaattt 



tattttttaa 

gtgggatagt 

ttgtgcattg 

ttactctttt 
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ttgtatgtta 

agaagagggc 
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ctccctctag 

ctacaccaag 

tgtgtgtgca 

tatccagggg 

gtctgaatcc 

cagctaagga 

ntggggnnnn 
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agcattgcag 

actaatgttc 

gtctacggca 

cacagtaaca 

gtgcgcgcgc 

ccaaataact 

cccttcaggc 

tgtcataatc 

agttttacac 

cccacgctgg 

caccactttg 

aaatggcttt 

aagggcagtc 

tggaggtcag 

tcttcccacc 

agggcactgc 

atttgaacgc 

ttttttaact 
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gggaattgaa 

tccagccctt 

ggaagtgtaa 
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agggtcagca 
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ctcaggacct 

atttaacttt 

cataagcaca 

ctctgtgaag 

ggcattttac 

ttggttaaag 

gaggaggtga 

tcgtgttcca 

gcccaggaag 
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gacccctaac 

ctggggcttg 

ctgctgctgc 

gtgcaaaccc 

tggctcttag 

tttttgtctg 
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tcatcaacag 
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gtttacaaag 
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gcagggagag 
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aattgcctac 

tggctctgaa 
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acaggcagga 

cctgctgggc 
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aatacattca 

acagagagaa 

agcccttcct 

cagtctatgt 

tgaaggaaag 

acctatcttc 

gcacctcacg 

tctccttcca 

cctgctgagc 

agcctcattc 

gactcaattt 

gacccgagtt 

gggagcagag 

gttcagtaag 

ccgctgcagt 
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acagtaaaca 

tttaatccca 
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aaattttcat 

aagtctccaa 

agcagtcgct 

aagaggacca 

tgggatgcgt 

ggaacaagag 

catctttaac 

agttcatgta 

ctctaattat 

agtaaccttt 

gaattccaga 

agctgcccac 

aaaaccattc 

tgcaaatcaa 

tggtcccaag 

agctatcatt 

caaacccacc 

agcctgccat 

ctctgccagg 

ggtgaggccc 

caatatttct 

tatgaataat 

acagcacaga 



ttccatctct 

ctggctggtg 

tgctgctgct 

agtcaagaga 

aggagcagac 

cgcatagtaa 

ttaatgaggt 

ctgggctcaa 

gacggtgtgt 

aagcagttca 

ggttctaccc 

tatgagttga 

atgagaacag 

ccgagcacgc 

cttcaccctt 

cactatttct 

ggctgaagcc 
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agtctccctt 

ctcctgaggt 
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ccatgaggga 

cttacagtct 

aacttcccaa 

ggtagtgcac 

agttccatgc 

tgtctcaact 

cagttcctac 

aagtatatta 

tacaggacac 



tgggtcccat 

gtgtcgctgc 

gctgctgccc 

tttcaacacc 

tgcaccacgc 

agactggttc 

tctccctgaa 

tctgtgatac 

tcgtggcagg 

attgtctctc 

tttcccccaa 

aagagtctca 

agttcagaag 

ttctaaagca 

tatacaaatg 

aaaataacga 

agcctcgttt 

aaaatgtccc 

gccactgagg 

cccaactcac 

gtagctcaag 

gagaagatta 

acctatttac 

gccttcccat 

taaaatcagc 

cacacagtga 

cttgccttct 

cagctatgtg 

agggatggca 

gcctctcctt 

ttctgctgcg 

ataactgagg 

atggggcgtg 

cccaggcctc 

accacagaag 

catagccaca 

cagcataggt 

tacaattaaa 

ggcagaggca 

acagccaggg 
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aattacagtt 

tgaggctggg 

agtcccttgc 
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agtaacagca 

tactcccgta 

gagaagagaa 

atgcagttaa 

agatgtggta 
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gaagccagat 

actagattac 

attatggcaa 

gcaagcattt 

acctgtaatc 

cagcctggtc 

cccaaaccac 

tgacagaaaa 

tacagtacct 

acacacagct 



cttacccatt 

agaggctggg 

atatcctgga 

agcagtaaat 

tgcacaccct 

catttccctt 

gccttgccca 

taccactatt 

atccattaga 

aagtgotttg 

acaaagcaac 

ggtaccgtga 

cagccgtgtc 

gcaaataaga 

tttaccaagt 

gtttatgggg 

aggctgctca 

agagctcctg 

ctctcccttc 

cgggcatgct 

gaatgagacg 

aatagaattg 
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tatcactcaa 

aagagaaagg 

cctgctgcat 

ttaaaaagaa 

gagggcagag 
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ttctacagtc 
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cctacagtat 
catttttgct 
aattcttact 
gcgctggact 
ctttgctgtg 
acaatactcc 
aaaaaagcac 
ctcacagaaa 
aacaagtcca 
gacagacctc 
taaagacaca 
gctggggtac 
cattcccagt 
gtgtacccag 
ttgggttcaa 
aaccacagag 
gggagggggg 
gtaaacaagt 
aattttttta 
ccatgtgggg 
taaagaaacc 
taggagccca 
aattaaaaca 
gttaagagca 
agctaacgcc 
gcccgcaaac 
gattctttca 
ctctgcagac 
ggaattaaag 
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acataattaa 
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atgaagactt 
tctttgtgtc 
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acatacacac 
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atgccaggga 
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gatatgggtc 
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agagaggaag 
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gctcattctt 
gactagagtg 
aactggagtt 
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ttacaaacag 



atgcagcatt 
caatatggca 
agcctgggct 
agcttccaga 
gtgaaaggtg 
tggcttggta 
agagctcaca 
tcaagaagaa 
cttcagaggc 
actcagaaca 
gaagatgggc 
gctgtaacaa 
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ctcttttggc 
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gactgtctag 
gcagtaataa 
gtgtttgctc 
ggatgtaaat 
ctgtttctga 
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caagctatag 
tgctggcatg 
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atcacttctg 
actgcggctt 
agcacacttt 
aaacaacaga 
gggagaagct 
gtcggggtat 
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gcatttgcat 
ttatctataa 
gccatagttt 
cttgttaaag 
atggaggctc 
cttcaagagc 
aaatacaaac 
acagggtttc 
tccaacccag 
cacacattat 
gttaagtttc 
gtgaaataaa 
atacttaaat 
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gtgagtgtct 
ttctcgctaa 
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agccttgtta 
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atcatccatc 

tgcatggtgg 

gacgtaagag 

cccctgaagc 

ggtttgagta 

accaaaacca 

ttttgaagaa 

taacaacaac 

catttacaag 

catcaattct 

ctcactcatt 

atcacaggca 

ccgcacactg 

cttgcagtgc 

aaaaggatct 

ggggggcggg 

gcagaggttg 

aaaaataaat 

tttatgtaaa 

cctggtagga 

atgcagtggg 

tctactcaaa 

tggctcagtg 

ttccagggga 

ttacctacat 

accatcttaa 

ccagaacttg 

ccacagtgct 

ttcttttttt 
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142680 

142740 

142800 

142860 

142920 

142980 

143040 

143100 

143160 

143220 

143280 

143340 

143400 

143460 

143520 

143580 

143640 

143700 

143760 

143820 

143880 

143940 

144000 

144060 

144120 

144180 

144240 

144300 

144360 
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ctccagaggt 

tttggttggt 

ttttgttttt 

agaaactggc 

cacgtgccac 

ccagcattac 

ttaagagttt 

ccagaggtcc 

gatctgatgc 

taaataattc 

ggatcataaa 

cactgctgta 

cagaaggttc 

ttatctgaga 

tctcattcaa 

ctcttccatc 

gcagtggaaa 

acactgacat 

cacaaacaat 

ttccaatagg 

gattccaaaa 

caaagacttg 

acatagcagc 

cctctacagg 

aagatattaa 

gtcatcatgg 

agctgactgt 

gcattatatc 

tagtgcacgc 

gagtttacag 

taaacaaatc 

aggtccttaa 

ctggagttaa 

agaagaacaa 

actgaggcat 

gagctgctac 

tttttaaaaa 

aaaaactgct 

taaattttgt 

tcattttaaa 

tcatttttcc 

ggtttgatta 

gactgtagat 

ccctatattg 

gagttccttt 

gtcatctctc 

tcacacctag 

aatctcacta 

actcacagag 

aagcccagca 

ctattgagcc 

gacaggccta 

agggttaaag 

taataagaca 

cactcaggag 

aatgggaggg 

tgtaatttaa 

tcatggtttt 

tccttttgtt 

gaaagactgc 

ctccaggagg 

ctggtgcttt 

caagaaaagg 

atagccctgg 



ttgtttttgt 

tggttggttt 

aatttaatgg 

tttgaacttc 

cacatctaac 

tttgactctt 

gatccagggc 

tgagttcaat 

cctcttctgg 

tttaaaaaaa 

ttcaaggtga 

ttcatttcct 

agtattcttt 

tgcagagtcc 

ggcctaacac 

tgggtcagta 

atgagctcct 

ccacccagac 

atgtagttga 

ccagcacttc 

tgtgtccaga 

aaagatggct 

tcacaatagg 

cagaaagcac 

ataatagccc 

gcagtgtaaa 

gagtggcttt 

cgtggatttt 

ctttaatccc 

agagtgtcta 

agatctgtat 

ggacaggaaa 

ggactgttgg 

gaatctatct 

gctatgtgtt 

taaaacctaa 

aagagtgggc 

gctgctgctg 

tgtatacttt 

ttggcactgt 

atttatacag 

cagaaactat 

ccaaaggaca 

atcatctctt 

catcacttta 

tttcagatga 

cctcttactc 

tgtagttctg 

atctgatagc 

aaatttacct 

agtcatggtg 

tgggttcggg 

ttttgagaca 

aatcagacat 

gtgagatcaa 

gaggaggaag 

ggcctaagct 

ctgtcttcgg 

ttcaggcatg 

agcccaagat 

caagacagac 

cctcttacca 

agactctccc 

gggaagagaa 



gggtttatta 

tgctttactt 

actggttcca 

tggtcttcct 

aatatctggg 

catagtttct 

tggagagatg 

tcccagcaac 

tgtgtctgaa 

aagagtttga 

gcctgggtga 

cattgatctt 

ttttcccaat 

cctttattcc 

tgaggacatt 

tgaagtgagt 

ctctgttctt 

ccattctctc 

caagaagtag 

tgtgtccata 

tattacctcc 

tgatggctct 

ctataacgtc 

agacatacat 

acgcttgagc 

agtaaaggtg 

cttctcatcc 

aagttaggaa 

agcactcagg 

ggcagccagg 

ttcaggaaga 

gtattccaca 

aatagttgaa 

gggcatggtg 

tgtggctaac 

aaaacaaaaa 

agaatgtctc 

ctgctgcata 

agctgtgctg 

acagaaatta 

aagcaatgta 

caaagtattc 

agtggagaaa 

ttacttctaa 

gatatataat 

gaaacaaaga 

actccatgaa 

gttgtcctag 

ctctgcctgc 

tcttaatttt 

agacaggctt 

gcagcctgat 

aaactttgtg 

aatattataa 

tccccagcat 

gaagagagaa 

ctcgactgaa 

tacatagtga 

tcttaaagac 

gatggtgcgt 

catcagtgat 

gattctgcaa 

agaacccatc 

cacacagatt 



gtgttttttg 

tttcttattc 

tgtggccaag 

tcatctacct 

tttctttatt 

tccagagtta 

gctcagtggc 

cacatggtgg 

gacagctaca 

tccatttaca 

actggcagaa 

cagaggtttg 

tctattcagt 

tgtgaccatc 

tcactgtgct 

attgaagaac 

ctgcacacat 

ttctgactca 

ctatgtcatt 

tcagctagat 

ttcgtttgtt 

tccagaggct 

atttccaagg 

gcaggcaaaa 

ttattcctct 

ccccgccctg 

cctaactgct 

atgacaaaga 

aagcagaagt 

gctatataga 

tgcctatgag 

tgtgtgcatt 

tgttcacact 

atgcacaact 

ctgggctaca 

tctaataatc 

tgattttgtt 

gcacacagct 

tcaactttgg 

acagccatat 

ctgtattaaa 

tctaaatgat 

aacttaggaa 

taatagtggc 

ttatctcatc 

cattgaaaaa 

tattcctttt 

aactcaatat 

cgagtgctag 

ctaagacgtt 

aatctttaat 

ctatagagag 

tcaaaaacaa 

acaagtatta 

aatgagggga 

aaaggaaaga 

gttgtccact 

agagtggaaa 

atagtggaag 

aagacagaat 

ctcgaaagca 

aggaatgtcc 

cctgcagctc 

tctggggcct 



tttggttggt 

atttttttat 

gatagcctca 

cccaagtgat 

ggagtttgaa 

ttacactttc 

taagagcacc 

ctcacaatcg 

gtgtaatcat 

ctggacttct 

gtggcagaag 

ctaacgggaa 

ctttagtagt 

tcgttgtttt 

atgccaatcg 

ctggtgtcct 

tgaaatcaac 

tgtcacctct 

gtccacagtg 

gtgctgctga 

tcttctaaat 

gggtttgatt 

ggtctgactt 

cacataaaca 

gatgatacag 

cccaggggca 

gcaacatcat 

ttagatctat 

aggtggatct 

gaaattctat 

ctgtttgaca 

ttacagaaat 

cagtgttctt 

gtattcctaa 

tagtcaatat 

tgggatttta 

cagatggcca 

aatatttgac 

aaaaaaagta 

tagtctagac 

tattcagtct 

gaaaaaagat 

gaattttccc 

catttattga 

ctaaaatgac 

tctaacttgc 

ttttttttcc 

atagaccagg 

ggttaaatgg 

tctcctctta 

ctcagcactt 

agttctatgg 

acaaacaaag 

gtgtcactca 

ggaggggaga 

tctcaaagca 

tttaatgacc 

ccaagggtca 

caggtatgca 

gcctggtgaa 

gcctcactgc 

agcatctgaa 

tcaccaatgc 

ccctctctgt 



tgtttgtttg 

ttctttattt 

actttgtagc 

gggattaagg 

agggattcct 

atttgttaca 

aactgctctt 

tctgtaatgg 

ataaataaaa 

tgaggcagca 

ctgtgtctca 

gtaagtggaa 

agatccctca 

tcagggagtg 

ctctgcagcg 

gcctgtggct 

tagcttgcaa 

cataggtgac 

catggatttg 

tagtatttta 

aagccaggca 

cccagcccca 

cctgttctgg 

taattgaaag 

ctctcctgaa 

tcagtgaggt 

caactgtgga 

ggccaggcac 

ctgtcagttt 

ctggaaagaa 

tgtgtgatag 

tggttataca 

caaatcaaat 

catgtagaag 

tggacagtca 

tattttcctt 

cagaacctag 

tatacgtata 

tcccagttta 

acattaaact 

tatctacagg 

tcaagaatct 

tttatcccct 

acatacccag 

ctgttgatga 

ctgcataaga 

tttgagacag 

ctagcctcaa 

atgtgtcacc 

aaaaatggaa 

aggaggcaaa 

gttaaagttt 

ccagactgct 

attaaaaagt 

gaaggaaatg 

gaacacagga 

cttttcatgc 

tcctggaatt 

tgggtatcct 

cctggggaga 

caagccaacc 

aataaagatc 

cttcgaggat 

cagcttatgc 



144420 

144480 

144540 

144600 

144660 

144720 

144780 

144840 

144900 

144960 

145020 

145080 

145140 

145200 

145260 

145320 

145380 

145440 

145500 

145560 

145620 

145680 

145740 

145800 

145860 

145920 

145980 

146040 

146100 

146160 

146220 

146280 

146340 

146400 

146460 

146520 

146580 

146640 

146700 

146760 

146820 

146880 

146940 

147000 

147060 

147120 

147180 

147240 

147300 

147360 

147420 

147480 

147540 

147600 

147660 

147720 

147780 

147840 

147900 

147960 

148020 

148080 

148140 

148200 
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acacaccctt 

ctggtgaagg 

cattgccttg 

tgtctgaagc 

agccaggaat 

gggctgctcc 

agatttgaag 

acctgtttac 

agtgaagaga 

tatggaagta 

gaacacagat 

agcgtgcagg 

gcagatttga 

gcccaggcaa 

attggtattt 

aataaatcag 

agctttgagc 

tctaagacag 

tctcttaaaa 

caagaggcaa 

caggctacat 

caatagcaac 

gcttttttca 

catgtatgtg 

gccactgagc 

atttattcat 

ctaccatgtg 

tcctgaacca 

tattatatgt 

tagagatgat 

gcagtcagtg 

tggtttccat 

gaacttggtg 

aatgatggtg 

catactacac 

tggagagtga 

aagaaaagtc 

tttctccccc 

gccggaggat 

aggacgacat 

atgttttgcc 

gggcatcaga 

ggaatcaaac 

tttattacat 

cccccccccc 

tctagttaat 

cagattaata 

tacactgtcc 

tgccaattta 

agcagtagga 

agttagtctc 

cctctcaggg 

gtgagtggaa 

atgggtgaga 

ctctgacaat 

ttattggcag 

ggacagaggt 

cccaggaatc 

tttcaccaag 

gagaaggaga 

tcaagacact 

tctgccttca 

tggctcggct 

ggtttggttt 



tgaaaggccg 

aagatgcttt 

agaacaagtg 

acactttggc 

gtatgcagca 

tcacactcgg 

atgttgatag 

ctgaaagcaa 

agagcaagat 

attataatgc 

tcatctgtgc 

cccaaacctt 

ctgtttgggg 

taacatccat 

caagagctaa 

tgacctaaaa 

acatttctgt 

ctttagctat 

acttgagcca 

aggcaggcag 

agtaagatcc 

aaattaattt 

gacacaccag 

gttgctggga 

catctctcca 

tttgtgtgtg 

tgtcttggag 

tctccctagc 

aagtacactg 

tgtgagtcac 

ctcttaactg 

tttatccaaa 

tgtgagcatg 

gcctgcctga 

actagctgag 

tcatctccac 

cttttaaggt 

tccctcattg 

ctttgtgaac 

agatataccc 

tgcatgtatg 

tctctcagaa 

caaggtcttc 

attttcctca 

ccccgagcag 

taagctgaaa 

ctctgttagc 

agattttgcc 

tacagtgttt 

tttgaaagct 

tgtggtacac 

tgtccagaag 

ttctgcagca 

gggccttctc 

cttagcctgt 

taaatcagag 

cattggctga 

tcaggtgctt 

gttatgtatg 

agccctgctg 

gatagacttt 

ttggctcttt 

agtaaaggtg 

ttgttttttt 



ccatctagtt 

tgtaagtgtc 

ggataccaga 

cacagtacct 

ggcatgggac 

ggtcttctgc 

agttgtaagg 

gaatctggtc 

aggtgaaccc 

tatcttcagc 

catcagtgtc 

aaagtcccct 

agcagccatg 

atgtcacaca 

tgtttaagga 

ggaaaacaca 

gaacccagag 

acagcacgtt 

gagccaggtg 

atccctgaat 

tgtttcaaat 

tttagatgta 

aagagggcat 

attgaactca 

gtccattaat 

gtatacatac 

atcaaactca 

catttctagt 

tagctgcctt 

catgtggttg 

ctgagctatc 

gatgccttta 

aagaccagag 

gattccagga 

ctgtgtgctc 

aggcaagcac 

ggtggtggtg 

tgatggcaca 

tggaggtcag 

tgtctcaaac 

tctatgcaca 

ctggaatttc 

tggaagagca 

attacatttc 

caagtattct 

gggagggagg 

ctaactacac 

atcctttttg 

ataggtattc 

gacatttatg 

attttggggt 

ctgctgcaca 

tcccggtcca 

caagggcctt 

gcatagtttt 

gttatccact 

aggataacgt 

gctgactggg 

gcagagggta 

ataagggaag 

gtccttaatt 

gagccactgc 

tttgcctcta 

gagacaaggt 



gccacaaaag 

cttatcccag 

gttaccagtg 

tgtggcagct 

ctctaccata 

catgttgccc 

atgctttgtg 

aaggcatttg 

ccaattcctg 

cttttgcccc 

tccgcactgt 

aagctgaaag 

ctgcgacaca 

ggtaagtcag 

gaaaaacact 

gccgtcttat 

cttgggaggt 

tgaggtcagc 

gtggtggcac 

ccagcctcgt 

aaataaatat 

tttatttatt 

tggatcccat 

acacctctgg 

taaaaattta 

tataatacag 

ggttcttagg 

attcttttct 

caaataccgg 

ctgggatttg 

tccagcccca 

agggcctggg 

ttcagatccc 

caacggagac 

aagagagcag 

acacctgagc 

ttgttgtttg 

ttcctttaat 

cctgttctac 

agacaaaaat 

ttatgtctgg 

agacacttat 

gcaagtagtc 

caatgctatc 

tcattgctgg 

tagatgttgc 

tatagaagct 

ngtgtatatg 

tttgtgacgt 

tggcctatgg 

ctatctgcgg 

ctgggctgga 

gctgggagtg 

cttagtgtta 

ttattcgaga 

cttagggaag 

actgagatgc 

tttcttaggg 

taggggtttc 

tcccccattt 

agcagggccc 

tacaaaatat 

agcctgaagg 

ttctcagcat 



acattctccc 

gagaaatgcc 

gagacctcta 

gcagctatgc 

gcacagagtg 

tgagaatgac 

acgaggttgg 

attacacaag 

gatgcaatgl: 

atactgaaaa 

gcagacaact 

cagtgacccc 

tgctacagct 

aaatggtttt 

ataaaggaag 

atatcattat 

ggagatagga 

ctgaactaca 

atgcatttaa 

ctatatagtg 

aacaaaaaca 

ttatgtatga 

tacagatggt 

aagagcagtc 

aaactagagt 

gttcataagt 

catgggagca 

tttgtcttga 

aaaggggaat 

aactcaggcc 

tttttagtat 

aagatggctc 

tagcacccag 

agacaggggc 

ccctggctta 

acacagacat 

ggggtctttt 

ctcacatctg 

atagcaagcc 

tatttatttt 

tgcccatgaa 

caagtactgc 

tttttttttt 

ccaaaagtcc 

gccatctccc 

ccaaacttag 

tattctttag 

tctacagatc 

ggatctttta 

tcctgttaaa 

ttccgcatct 

aggatgaagt 

aatgctgggg 

cagctctagg 

tgagcttgta 

gagataggaa 

tcattagcac 

ggttgagagg 

aggctcccca 

tgagccactt 

aacaagtgtc 

ccaaatctgg 

cctgagtttt 

agccctgggt 



aggagaactg 

acgacctcat 

ctgtcaccga 

caagtattgc 

ttctcttggg 

tcttttagcc 

tagcacagac 

tcagggagag 

caatggaaag 

gcatagccca 

caaagctgac 

agggctgtgt 

gcagtgtaat 

tacttacatt 

cctggcatca 

gctattgaga 

tgattaggag 

tgagaacttg 

ttctagtact 

agatccccac 

gcaataataa 

gtacaccatt 

tgtgagccac 

ggtgctctta 

atttttaaac 

caattctctt 

agtatttact 

aagatttatt 

caggtcttgt 

ctccagaaga 

tcttattaag 

agtgggcatt 

gcagatgctg 

cctagctaac 

ctgtgcagga 

gcacaaagga 

gttttgtttt 

ggacaaagag 

catttcagcc 

atatatttga 

agccagaaga 

ctgagtgcta 

ttaatatttt 

cccataccct 

catctccttt 

gatttattga 

actttcacat 

ttaattcagc 

cccatcttaa 

tcacatttca 

cacacttttc 

ggagtccaga 

tcaggaggag 

caaaggcctt 

tgcatacact 

tacccaaggt 

ggggaggcat 

gtagcagtga 

gacaaagaag 

cagaaggcta 

tgttttcttt 

gccaggaagc 

cacttgattt 

attctggaac 



148260 

148320 

148380 

148440 

148500 

148560 

148620 

148680 

148740 

148800 

148860 

148920 

148980 

149040 

149100 

149160 

149220 

149280 

149340 

149400 

149460 

149520 

149580 

149640 

149700 

149760 

149820 

149880 

149940 

150000 

150060 

150120 

150180 

150240 

150300 

150360 

150420 

150480 

150540 

150600 

150660 

150720 

150780 

150840 

150900 

150960 

151020 

151080 

151140 

151200 

151260 

151320 

151380 

151440 

151500 

151560 

151620 

151680 

151740 

151800 

151860 

151920 

151980 

152040 
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tcactctgta 
gcttagatta 
gatggctcag 
gcaaccacat 
ctgaagacag 
aagaacacta 
ttcattctag 
gaaaggccac 
ttctacctgt 
gatggttgtt 
gccaccacat 
agttacccat 
gcacatattt 
agcccagttt 
aaaaacaaac 
ttgggcgtgt 

ttgggaggca 

gaaaccctgt 
gttaatatga 
ggagacattc 
cnnnnnnnnn 
nnnnnnnnnn 
aaccgtcagg 
attaaagcac 
gagaggcaca 

ctgggagaag 

agcagctatt 
tctgcagggg 
ttgtgtctat 
gtgtgtgtgt 
gtcagtgctc 
ggccaggctc 
taggaaataa 
aattggattc 
tatgcagagc 
gactgggctt 
tgcccgtcct 
gctactgaaa 
cagggccctc 
ctcctgaggc 
ccatccaacc 
agaaacctct 
gctaatgtct 
tgtgttgcat 
tgattcttgc 
gcccagggac 
tttttttctt 
tagtttataa 
tttatatatt 
taagtggaca 
aaatgtcatc 
ccgggggctg 
gtggcatagg 
tctgagattg 
tccaaaacca 
agcttcttca 
cactgtgctg 
atacccttcc 
cggtggctga 
aaaaaaagaa 
agtgctgggt 
ggccaggagt 
acggatgggg 
gtcaaagctg 



gatcaggctc 

aagttgtgcg 

tgggtaagag 

ggtggctcac 

ctacgggtgt 

ttcacaaggt 

gggaaaaatc 

agcacgtgta 

ttttgaggca 

ctatttctgc 

aagcatggca 

caagccatct 

aatcccagca 

acaaatcaag 

atccttctgt 

caaaaatcac 

gaggcaggag 

ctcaaaaaaa 

aatagcagaa 

catactgaaa 

nnnnnnnnnn 

nnnnnnnnnn 

ctcagaggca 

caaatcttgt 

catagcttcc 

cttgaagctc 

cagcagctgc 

aaagacaagg 

gtgggtgagg 

gtgtgacaga 

cctttggctc 

gcccacccca 

aaaacagggt 

agcatttctt 

acaagagact 

gcctgcaggc 

ggggtcatcc 

caccctcatc 

agtaggtcac 

cagagctcag 

cattggacta 

tccccagctc 

gcagcttcta 

ataaaaacga 

tgcttctctt 

tattgttgct 

gctagaagtt 

acatgataag 

tatatatata 

taggtataaa 

cactggcgat 

cctgagacgt 

gttggacagg 

ctggttgccc 

gcagggtcct 

aggggcgcat 

gagtggatac 

cgtctacggc 

aggcttcagc 

agaaagaaag 

gacggagccc 

ggggccccaa 

gtcctgcgag 

aacttgtgct 



aacttgaatt 

ccaacactgc 

cacccgactg 

aaccacctgt 

acttacatat 

acaacaacag 

tttttaaact 

tccaggttag 

gtctcttgtt 

ctctcatgtt 

ccagcactga 

tgctagcccc 

ctcaggaggc 

tcccagaata 

gttccagtca 

aagtagcaag 

gatctctgtg 

ccaagcaaaa 

ctcaagatag 

tatatgcaga 

nnnnnnnnnn 

nnnnnnnnnn 

ggatggcagt 

actaaaaaca 

gggcccttgc 

aagaactggc 

ggctcttggg 

agatggcact 

gaagtgccgt 

gacagaaagg 

ctcctgccaa 

ccaaagccca 

tcttgtccaa 

actcctcagt 

gtgcttgcca 

tgccttctgg 

aggaggagca 

tgctaaggcc 

tggccaaaga 

agcctccctg 

acactgtaga 

cgcctagctt 

cagtaggggt 

ggtgatgttc 

ccttcccgac 

ggctggggtt 

accgatacag 

acatacagtt 

tatatattat 

actgcacctt 

ctctcctttg 

caggaatgag 

gcaggttaac 

cagctggctc 

tcagcccttc 

gtcccgacac 

ggtccagagg 

tgtacatttt 

agccttttcc 

aaaagaagaa 

tgggtcctca 

gggccgtgcc 

cacacttgcc 

gatagtcggg 



cagagatctg 

ccacctaaaa 

ctcttccgaa 

gatgagatct 

aataataaat 

tgtactagga 

ttagtgtgta 

gagtcaactt 

tctgccctac 

gccctatgca 

gccaagggca 

agaaaatgta 

agaggcaggc 

gccaaggcta 

caatgactgc 

gtgtaatggg 

agtttagcac 

atagaattac 

ctaatgaaac 

gcctgatgct 

nnnnnnnnnn 

nnnnnnnnnn 

gatgtgtttt 

gccctctgct 

cctctgctct 

cgctgttgcc 

caagcttgga 

gagctgggta" 

gtgtgtgtgt 

gagagtgcat 

aaagcatgct 

ggatctgcca 

gagaattttt 

atcctctctg 

gaaacccttg 

gctgacccct 

ggaccactgt 

actagatttt 

gcctagatgt 

gcctgcacaa 

cgctgccttg 

gcctccagca 

gacggggtgc 

taagtatcta 

tcttcccact 

ctgaacaaaa 

tccttaattg 

tggtaaggag 

gtgggtgtgt 

ctgtgtgact 

gtgattggtt 

aggcattacc 

taagtgatct 

tgggagagcc 

tctccaaatg 

cactaatgac 

cgctcggtca 

gggcttataa 

tgaaacccag 

aaaaattctg 

gcggagagtg 

tgaggatgcc 

cttcctcttg 

ggcatagtcc 



cctgcttcta 

aaaatatgag 

ggtccgaagt 

gatgccctct 

aaatcttaaa 

aactttaaaa 

agaaagagag 

ttcagaagcg 

actttgtaca 

tgctgagctt 

ggcatctggc 

tttttgacag 

agatctctgt 

catagagaaa 

tgtaacaata 

caggccttta 

agccagggct 

aagttaacca 

aaaggattct 

tgcctagcaa 

nnnnnnnnnn 

ngcctgggca 

ggggcacaag 

ctcgatctgc 

cccggtcagt 

tttgtgctga 

^gtagctcgt 

gccaggatgt 

gtgtgtgtgt 

gttcttgtgt 

gtgttctcag 

ccccattaag 

attattattt 

gtagggaatt 

ggccaacagc 

gagtctggcc 

gatacagggt 

tttcccactg 

taaaactgca 

agtgctaaga 

tctgccagct 

ccctgacatc 

tccggggtgc 

agaatgttgg 

gacacctttg 

ttgccgcagt 

agcaaatata 

tgggttgggg 

gggcagagtg 

ctcattgcga 

cttggacccc 

cccatggata 

cagacaagag 

ttgttctgag 

accaggcttc 

tcactttgcg 

ggacagccga 

tccaccagga 

cagatcttcc 

ttccttgctg 

actgccagcc 

tgctgcaccc 

ggtcgtgcgg 

tgcagttcgg 



catcccgagt 

gggctggtga 

tcaaatccca 

tctggtgcat 

aaaaaaaaaa 

tagcccatca 

aggggctgta 

gagtctcccc 

tgaacttcaa 

actgatgcca 

tcacatggag 

gtgtggtggt 

gtctgaagcc 

ccctgttttg 

atatgaggat 

gtctcagcac 

gttacacaga 

gggtattggt 

attttataaa 

ctgaactaca 

nnnnnnnnnn 

agagtaagca 

gccccctttc 

agagatacga 

tcctggactc 

cgccagaacc 

tccttctcct 

gggaagtaaa 

gtgtgtgtgt 

gttcctgtga 

agtcaagctt 

aatgcggggt 

tttctctctt 

caggtctgtc 

cctattccct 

ctctgacctc 

tcctgagatg 

ccagactact 

gctaaagcct 

aggacattgt 

tgaaaacccc 

cacttctctc 

cagacaggcg 

tcccctgaag 

cccccagcaa 

tttttgtttt 

ggttcccgta 

gacctgtgca 

aggatatata 

gtacagttct 

agcaggtctg 

gggactgagg 

ccaagagtgc 

tcctgctcct 

cgcagagccc 

tgcctttgac 

gtgagacgtg 

agggtccaga 

acttaggaaa 

gacatgtggc 

ccagtatcca 

cactgggggc 

tgggcatgat 

taagctcttt 
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cccagagctc 
cttcttgtaa 
ggccccatgg 
agacttgatc 
cagcttcaga 
gccacggaac 
attcccattg 
ctcagtgagg 
cttgtggtga 
gccccggaag 
gagattgacc 
gtagagatac 
acacttatag 
cgttcgcagc 
ggtgttggga 
gctgaagtgg 
gtcctctggg 
gggcgcaggg 
gagcagcaac 
ggggctgccc 
aatgactctg 
cagggcattt 
tctctaattc 
ttgtgaacac 
tagctgcctt 
tatttaaata 
aataagcgcc 
gtttcctcat 
tgaggggctg 
tgccagatcg 
attgccaggc 
tgtagcttcc 
attttcggat 
ctcattatgg 
ccagcaggcg 
ggccaggcag 
gttcagctgg 
ttcttgctgc 
cccacttggg 
gacttctgac 
gctgttggga 
ggcttcagct 
aattgggttc 
taggacccct 
cggcccttag 
tgtctagatg 
ccagcctctg 
ctgccctgtg 
tccctgcttg 
cggtcccctg 
tttatgctgg 
cttttcttcc 
accattcgac 
ttcccctgtg 
ggaagtccct 
gctctattta 
agtttataaa 
actggggttt 
tgcagaacca 
caatggacca 
ctggaagcat 
tccacatgcc 
tgaggggctc 
ctgagcaagc 



accttagaga 
cctgacctgg 
gacggatggt 
tggtgaggag 
tcctggcctt 
cttcgcagcc 
aggcgaagga 
ctgttgttga 
acccactgta 
atgccttggc 
aggtccacaa 
tgcaggctgt 
aggtagaggg 
tgtcggttgt 
gcaatgaaag 
ccctgctgga 
atgccctccg 
tagcacacac 
agcagcaatt 
agaggaggtg 
ttggctcaca 
ggggccaagg 
catccgaaca 
aggcgagtcg 
agcaaggaat 
attcctcaaa 
tctttcacac 
gagttaaaca 
tgagtgacac 
aggctgcatg 
tgtggctgta 
ctctaattgg 
gtgtccagcc 
acgattgtca 
cgtcaggaag 
ccacgaccta 
gactcttggg 
tagctgtgct 
gtcccaggcc 
gccccccccc 
gttatgggct 
agaatgtttc 
agttgtgatt 
ttctgggccg 
acagcatgca 
ttactctgtc 
tcctagggat 
ccaccagcac 
gcttgcaccc 
ggaaagcctg 
atgagacaca 
ccggcctgca 
atttgtacag 
ttctcccacc 
tggggagatt 
ggaaaaaaaa 
cataccgtgg 
tccctctaaa 
ggctcagttc 
caaggacctt 
actaatagga 
agggatgtta 
agccactctg 
tgtcctagtc 



tctggttccg 
agccaggcgg 
gctccttgcg 
acactggtcc 
gccgcagctc 
attcccacag 
actccaaggc 
agagaaagag 
gctggttctc 
ccaggctcca 
agatgtcatc 
gcaggccacc 
cgtgaagctt 
ctccaaggtc 
tgatgttgtt 
ggaaggtgat 
ggatggcagc 
agtctcgagg 
ccacacagca 
gagatagatg 
gaggttctgt 
cccacattat 
cttcttcaga 
gcctccggct 
gactctaaca 
ataaagaaag 
tgtggtacaa 
cagttcacag 
cagggctgac 
ctcctctccc 
aaaccgggcc 
gcattaatta 
tgtggtcggg 
ttacctgcct 
atggggacag 
tgtcgcctca 
agagccaggg 
cagagcaagt 
cagggactgc 
cattcccaac 
cccagagagt 
tagcacataa 
tcagagcaat 
ggccatgccc 
ggctataacc 
gatcttcctc 
cctgtgctgc 
tcacactggc 
tctcttccag 
gactctaggt 
gcagagcctt 
gctgcaggca 
ctcatcggag 
tactgtggct 
acccccctta 
aatcaaactg 
caagtggagg 
agggcaatgg 
ccctgttatc 
gtccttgtag 
tgtaggctca 
accttccaag 
catccatgtc 
agccctggcc 



gttcctgtgg 
atggccatgt 
ggcagccctg 
tgtgcagttc 
gggggtcgcg 
ggaacgtgcc 
caccaggggg 
ggtggttagc 
atgcagcagc 
tagcttgtta 
ttggaggtac 
aaagatgcct 
caccaggcct 
tagctcctcc 
ggagtagatc 
gcgattgttc 
aaagttgtgt 
acaaccacca 
ccctggtggg 
gggaacagag 
cctctgtatt 
ctcctcacct 
ctgtcagccc 
ctgggtccgg 
aagcaaatct 
cattgagtcc 
aaacttcaaa 
ggctgtgtgt 
gcacgaggct 
ccattcacac 
ttgcctcttg 
ggcattcgtt 
taataggcct 
ttttccaggg 
gctccaggcc 
gttggcctct 
cccctggggg 
ggaaggagca 
ccaggccccg 
agaaacagca 
ggcagctgct 
aaatcatcgc 
tatgacctca 
tgcaggccct 
atctctgcct 
ctcagcatcc 
agagggaggc 
tgcacagtcc 
gaaggccatt 
tcaaggacat 
cttcactggg 
ggtaggggtt 
tttacagagg 
gctctgtctc 
cataggggcc 
aaccaaacct 
aggcaggcgg 
ggggacaaag 
tcaaggctat 
gacagtcact 
ggacagctgg 
cctccatctc 
tttgaaagcc 
acttctggac 



ctggtgcagt 
gggtggcctt 
tcagaggtgg 
cggaagtcct 
caggggacag 
cggcagccac 
gccagacagt 
ctgtggaggt 
aaccggtcca 
ccatggagaa 
tcgatatggt 
gcgggcaggg 
tggaaggtct 
agatgcacaa 
cagagggtga 
tgcaggaaga 
gcctggcagc 
cccagaggta 
gagagacaga 
ggtggaatgg 
gtatgcaggg 
ctttagctct 
caccgcaggg 
ctctgccact 
ggagtcctga 
atggtaccaa 
cctacaactc 
acagacaagg 
tcccttgggg 
ccccccctcc 
actgtccaga 
aatggatcct 
atgctcatta 
tcacagctgc 
tacgggcgcc 
tgccccttct 
aatatgagct 
gggaccttct 
gcagagtagg 
tcgtaagttg 
tctcgtccct 
atataattta 
gcagcagggg 
ctgcagtgct 
caatttcctg 
tgggtgtggc 
acagtcggag 
acagacccac 
cttgccagaa 
tcatgatgct 
gggtcctgtg 
gcagtgggct 
gcttttgttg 
tgtgcacccc 
tccaaggaat 
caggtgtgga 
cagaacggca 
ggactctagc 
catcactagg 
tcctgcagtg 
tctctgtcct 
ttctaatggg 
agtggtatta 
tccttgcctg 



tcttgcctgc 
tgtccctgga 
taagcgtgtg 
ccaccctcag 
cagagctaga 
agtcccaagc 
caccctgcag 
catggaaagc 
ggttcaccag 
acaagtgact 
tgtcctgcaa 
cgctcagtcc 
cgggtgccag 
agccctcgaa 
ccatggcggg 
tgcgctcact 
tgacagtcat 
gctctccagc 
acagcagtga 
gggactggca 
gtcccttgga 
gtccctaaag 
tggaacacac 
cgctcactgt 
atgatcactt 
agcatgcctc 
ctccatggct 
cacatttctg 
ttcacagtac 
tgtgctggag 
gcatttcctc 
taaaataatt 
tggaagccgc 
cccaagtggc 
caccctgaag 
tttccagctt 
gagctgaatc 
gaccaggctt 
ttgccacctt 
acagctccca 
gtaatcaccc 
gtttttgcat 
tggcacagca 
ttctgcccac 
ctccaaaacg 
ctccagcctg 
ggaggggagc 
agctccaacc 
cctttcccaa 
tgccccacat 
aaaatgaaag 
tatcactaat 
taccctcaac 
aaaagaatct 
acaggctaca 
cttagtaacc 
tgaaggtagg 
caagacctga 
aggcctaagg 
aagtgctctt 
ttagtatttt 
gggggggtgt 
ctccaggacc 
agtcagtagg 
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tgccaatcct 

atggcgcctt 

attaatgtac 

aggaatccaa 

cttgggcagg 

cttcccctcc 

tcccacgcac 

acttgccccc 

atcatgcccc 

gaaattaaga 

cttggtcact 

tttgaactag 

gctcagacct 

tgggtcagct 

catggtccca 

atgtgcaaag 

agcctccaag 

accagaacca 

aggggccctg 

gtctttaatg 

agcatcagtg 

ccctgccttc 

tcgaaatggg 

agcacccaag 

ggtcagtggg 

ggccttcagt 

tgttaaccga 

ctggggctgc 

ctccttcctg 

ctcagcctcc 

cagggctgtg 

agcctcgttt 

cccacagata 

ccaaaccaag 

taaagatggt 

ctctggggaa 

actcgctatt 

gccctgcctc 

agtcaggaag 

ccccaggtct 

ccgtccctcc 

tacctggacc 

agcgatttgc 

acagcacgac 

tgctccctgc 

gtctctggag 

tcatagtagg 

ggagaacaga 

ccgtttgttg 

aaatagactg 

atgctaaaga 

cagaaatgtc 

aggtgcttgg 

aagtgagttc 

taaaaatctt 

acacacacac 

gggctgaact 

tgtctgtgta 

tggatggaga 

actcacatct 

cccaggccca 

gcccagcatc 

agcaccccag 

cccaggccca 



aggattgtca 

ctatgccccc 

caacaatgag 

gtctctgtgg 

gctgccttct 

aggaggtgcc 

gtgagctccc 

tagtctagga 

atttgcatat 

gaggggccta 

gatcccctct 

ggctgagaag 

cagcacacac 

ggcaagcccc 

tgggaccaga 

gcactggggt 

gaatactgta 

gcatcaacag 

tagcaccctg 

aaaagccctt 

ggctggttgc 

aggctggggc 

ctgggtcagc 

ggggctgccc 

tctctctgaa 

cctccctccc 

atgctctctt 

tggcgcctca 

gctcggccct 

actgctgtgc 

gagccagagg 

tcctcacctg 

ctggtaccca 

ctcttctctt 

ccctttacat 

attctgctgg 

gctaccattc 

ctcaccccta 

ggctgctcct 

ttctcccagc 

cacatgccag 

ggttcccagt 

aaggagcttc 

ataagtcact 

acacagtagg 

aattgaatga 

tgctcgataa 

cacctgaact 

gatgagcact 

ggactgggga 

aaaaaaaaag 

tgggtttatg 

gaggcaaaga 

taggacagcc 

agtgggggag 

acgttagtat 

cagagtgtta 

gatgtgtacc 

catggttgtg 

acctctggag 

gcaccccgac 

ccacgcccag 

acccagcacc 

gcaccccagg 



ccagcaggtt 

tctatgaggc 

gatggagcct 

aggggccctg 

acttgagggg 

cttcatcgtg 

aaggggctct 

gagccatgga 

gcggtggcaa 

aggaaagcct 

gtaacaccag 

ggtctgctct 

tcaacagaag 

aaactcttcc 

cctggtagga 

gggaaagggc 

acctcagctc 

ggcctgtctg 

gccaccagaa 

ccggaagcca 

agctggagaa 

tctagccagc 

tttttcaggc 

ctctacccct 

gctttgtagg 

tgcccccagc 

gctgtgggga 

ggccacttgg 

gaagacaaga 

agagcggccg 

cctaagttga 

caaaatgggg 

cctgcccttc 

cctctccggc 

cagctgcctg 

ctagatttta 

acccagtagc 

cacctccttc 

tagtctgcat 

ccagaggccc 

ccctcagcct 

gtgtctgcag 

tttcctggga 

tgtttccttt 

tgctcattaa 

tgacactcag 

atgttggttg 

tggctcacat 

ctgagaagga 

gtttcccaag 

ttagtcactc 

aagagaggaa 

cagatggagc 

aaggctacat 

cagtcaagga 

aatatcatac 

gtgtgtgcta 

tgaggtgttt 

tttgctaggc 

actgagagtg 

ccagcatccc 

catcccaggc 

ccaggcccag 

tccaagcact 



tcttcctagg 

ccttgggagc 

ttgccatgca 

cacctcttct 

gcggaaggga 

ttctgcttcg 

acagcctccg 

agacagtgtg 

taccctggtg 

ctgctatccc 

cccttctgca 

gttcagctgc 

gcatgcaagc 

tgccaagctg 

taggtggcaa 

ctgtgcttct 

agctgggctc 

ctaaacccag 

ctaacgagga 

catttgcaca 

cacagcaggg 

tccctgatgc 

tccaaagggt 

aagtagaggc 

ctgcttcttg 

ctggccagct 

tggcggcctc 

cacttgtgcc ' 

aagctggcca 

gcagcctcct 

atcctggctg 

cagtcattgg 

ccacatctcc 

ctcctccgtt 

ggaggaacac 

gaatgtatcc 

ctcctggatg 

aacaggaact 

cctccagagt 

tgcctctccc 

ccctcctgcc 

ggctgcttcc 

cattttactc 

atgtccgctg 

cagttctggg 

gaagccctag 

taattagtcc 

caagactcct 

cccaatacca 

gtatgggatt 

taaaaggggc 

gccagatatg 

tctgtgagtt 

agagaaaccc 

ggaaacacac 

tatggctctg 

gtggatattt 

atgtgtacac 

atttgcatgt 

acaacccagg 

aggcccagca 

ccagcacccc 

catcccaggc 

caatgcccag 



gaggcaagca 

cccgccccac 

ttttaacatt 

gccagactca 

gaagacccag 

ttactctcaa 

tcattccttc 

ggaagggctt 

ggtaccagga 

tgggctacca 

acctgccaga 

cttggctggg 

aagggagcta 

agcatgaaaa 

ggctaaggca 

caccccctct 

gggtggccag 

acctcacaag 

agatctgacg 

gaagaaaatg 

ggacaggtcc 

ctggagtagg 

caggacagct 

atccccatgg 

gccatgtagc 

gctcttctct 

aggccaggcc 

acttgtgttc 

gggaacagct 

atccatggct 

ctccttttaa 

aagctcagcc 

atctgtctaa 

cacaacttct 

ctcagaatca 

agtatctatc 

ccctcccccc 

agggtagtcc 

gcccatctaa 

tggctacaca 

tttgtccatg 

cagagaagcc 

tacagcacgc 

ccactttgtt 

ggagggaatt 

gctcaagcct 

tgggagactc 

taggcccatc 

gtcctttcct 

ttcagcacta 

caggaccaaa 

gtggggcaca 

tgaggccagc 

acttgacttg 

acacacacac 

tgcctgcagt 

gagctctgta 

aggtgttggc 

gtctgagttc 

gcccttttat 

ctccagaccc 

agacccagca 

ccagcacccc 

caccccgacc 



ctgtatcacc 
tgattgcctg 
gcaaattagc 
tcaagcgcct 
ttccactctc 
gcctccggcc 
ttccattcat 
gacaatgagc 
gagtataggg 
gtcagcattg 
gtttttgacc 

a 9gggaatct 

gcagtggcct 
gccacctcac 
gcggaatagc 
aatggtgcag 
agagcttggc 
ccagtttagt 
ctgggaatat 
aggtgcccag 
taccaagcta 
taaagcagcc 
gctgcagctt 
cccctgggca 
caatctctct 
gagcaatcga 
agctgcactc 
taaacacagg 
gggctcccat 
gtgagtagaa 
tgcttgcagg 
agtcctccag 
ctgaaaccat 
ccatcttggg 
ccgtggctca 
cacttatcat 
ccccccgcaa 
agggaaagtg 
ctagaagctg 
gaccttgctg 
ctgttccatc 
actcttgagc 
gacctcttgg 
ctgatgagtg 
accttctcaa 
ggggcctgga 
agagccttca 
aaggaagtga 
gggcaagggg 
aactggaata 
acctttcaaa 
tctttaatcc 
ctattctaca 
ccacccaaat 
acacacacac 
ccaggaatga 
tttatgtgca 
ctgttgcata 
atgcacataa 
cctgcagcac 
agcaccccag 
tcccaggccc 
gacccagcat 
cagcatccca 
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ggcccagcac 
cagcatccca 
atcccaggga 
cagggacagc 
tggaggcagc 
cctcattgct 
tcccagtacc 
gtctggggtt 
cagcacatcc 
tggagcctac 
tgggagtcat 
gtatgggaat 
ctgcacggga 
aactcaggat 
ccagggctaa 
tctgtcatcc 
ctacttctcc 
aatcttctgc 
tcccttttac 
tttctncncn 
nnnnnnnnnn 
aaattgaact 
ccttgtgatc 
tagtggttct 
aggcattgct 
tcttaagatg 
agcatgtgct 
acaaaaccaa 
attgggacac 
gaggatgaaa 
aaaatatata 
ttgactgtat 
cccaaaattc 
caggaatgct 
acacttccag 
atgtccttgg 
aagtatttat 
ctaggcctag 
cagcctttag 
tgattcagtc 
gtcttttctc 
tctaatctct 
ttctctcttc 
tgtgccgctc 
ctgtgctgca 
ctctcttctc 
tttgtctgcc 
tttaccttca 
tgagagtaaa 
tctccataaa 
ctgccatgtg 
aataagcatt 
cgactacatt 
agtgaatttt 
ttatattaag 
tttcttcata 
cttatggaaa 
cctcccatct 
tgaactcctc 
gagataaggg 
catgacccct 
ctagtgatgt 
agcctatgaa 
cgaccccaga 



cccaggccca 
ggcccagcat 
cagcacccca 
accccaggcc 
acatactgaa 
ttcagcacct 
aggattcccc 
ggcagcccca 
tgcccattag 
cctgtatagc 
ttcacctagg 
aggtaaggag 
gattgtgtca 
ggtccagaat 
gacccttcct 
ccatctgtgc 
cagcatccca 
taggtttgag 
caccaccact 
nnnnnnnnnn 
nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 



gcatccaagg 
cccagggaca 
ggcccagtat 
cagtatccca 
gatagggaat 
gctctcctca 
tcagctgagg 
tgctaattgg 
gcctaacctt 
tctgtattga 
caactccaga 
agcatagtga 
ctgtggttcc 
ccagagctct 
tggacggttt 
ctaacagctc 
tgcccgcacc 
ggccccaggt 
aatcctcttc 
nnnnnnnnnn 
nnnnnnnnnn 
acacaaatga 
agattaaacc 
cccaaggaag 
tgacaaaggc 
tgggtggagc 
cataaagatt 
aaagtactta 
ttctcgttct 
gaacaacgca 
gtaaaacata 
agaaccagca 
ggttcaatgc 
caatggattg 
caaggtgagg 
tcatccttga 
taactgtcat 
ttctagcttc 
tgagatcacc 
gggctcaaac 
aattgctctg 
acactctctt 
tgtaaagctc 
ctcagctcta 
cggtaccacc 
ggcagatcct 
acatcacttt 
ttaaaggtga 
aataaggctg 
ttagggttca 
tgcagagagt 
ttggattaga 
ttgagactgt 
aggctgtggt 
aaaaaagcag 
tttttatagt 
aaaattttta 
cccggaacac 
tcgaaccctc 
caacctcaaa 
gattcccttg 
tttccctgcc 
ggtcgtcgat 
atgttccatg 



cccagcatcc 

gcaccccagg 

cccagggaca 

gggacagcac 

gtctctgagg 

cactcggaat 

aatgggtagc 

actgacagtt 

gcctgcaggg 

ggcactcccc 

tgggcacaaa 

gcctggctgg 

agactgccaa 

taagagagca 

cttggctttc 

tctgtggaaa 

ccagctcact 

tgaggctgtg 

ctcctcttcc 

nnnnnnnnnn 

nnnnnnnnnn 

gctgtaatga 

ccagcccttt 

gacagctgct 

tttctgaatg 

aattctctaa 

gtttactgaa 

atagctcatc" 

atcacacgac 

cacaatgaaa 

cataaacaca 

gagtgctatg 

agagttcata 

accaactgat 

cctaactgga 

tccttccctt 

cttggaggct 

catacaatct 

gtttcctgtt 

tcctctcccc 

cttggtctca 

gcttgttctg 

tcccggtaaa 

ctgcactgct 

tgtgtctccc 

atcctgtcaa 

caagcaggag 

gtactaaggg 

agccaactct 

caatacgatc 

taaggatgtg 

gtcattcctg 

ggagggctga 

tggactgtta 

agtagaaagg 

tagcttttta 

ctgaactctt 

tcctgaactc 

ccaactaaaa 

atgaaccggg 

gcagaacccc 

cagctctccc 

tctcctctac 

tgctttcttg 



cagggacagc 
cccagcatcc 
gcaccccagg 
cccaggcaca 
cctcttatct 
caaacaccct 
taccattttg 
tctcctgaga 
gtgtgctgta 
caagctatga 
aatctctcca 
gcacctgaga 
gacatcttgg 
gatgctgaga 
tactctgtcc 
acatgaggcg 
gtgtgccctc 
ggtttccctc 
tcctcttcct 
nnnnnnnnnn 
nnnnnnnnna 
cagaaaagga 
ttgcctttgt 
tggaatgttg 
catatggtag 
gagtctgtct 
gcagtatttg 
agtaggaaaa 
tattaagaag 
ttgaatagag 
taaaaagaat 
gtttataaat 
ctcttaggaa 
agattaataa 
ggaagtttgt 
tctccgcttc 
tactggctcc 
aatccaagcc 
ctttctgaac 
gatgatttta 
aactaactct 
tctttactgt 
cctgcctcct 
ctccacagct 
ttacgtagct 
acctttctct 
tgcctcctct 
tgtgtctctt 
agctagaaat 
aaatatcctg 
ctcttgaggt 
tgacactgtg 
gcgatggatt 
ctagctacgt 
acagttttgg 
tgttgctatg 
cttcacccca 
ttcaccccag 
actgttccaa 
tacattgcca 
ttgtcccttg 
cccttgagtt 
accactaggt 
ctgttgttct 



accccaggcc 
caggtccagc 
cccagtatcc 
gtatcccaca 
tggtccttac 
gtgcaggttc 
gcttttgtct 
gcaatttggg 
ggggcaggga 
cccatgccag 
ataagggtag 
cctgagcagc 
ctttcacccc 
ggcacttaac 
tctgtcccag 
tatgagctct 
atgttactca 
catctgtccc 
cctcctcctc 
nnnnnnnnnn 
caaaggacgc 
atactatttc 
ttttcatatt 
gagacctctc 
tatctattca 
gaacttctta 
caacagtgaa 
agaacaacaa 
attaaggtgt 
aataaagaaa 
agaaaactct 
gaaaagttcc 
gtaactaaat 
tctgaaggca 
caccagaggc 
ctggctgtaa 
atcagctaac 
tagaatgttc 
tctagctggc 
ttcacaatct 
agcaatcttt 
gtctagtttg 
cctccccctc 
ctcctgtatc 
tccctttcct 
gattcttcac 
acaaaccaac 
tttcagccag 
agtttctttt 
agacagctgg 
ttcagatggg 
acaaaggact 
aatacactgg 
ttagctggat 
cagaaggagg 
accaaaatac 
gaacccgacc 
aatgctttcc 
gaacattttt 
aataatagga 
acagaacccc 
ttactatata 
gcatgagttt 
attaaatctt 
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gccttctaca 

tgagtgcttg 

acagcgcgac 

ggtctgatgt 

gctcagtgag 

tgtccaccgt 

cctggtggac 

tcccacctcg 

agaccgtggg 

ctcgtgcaga 

tgtggtcgac 

ctcttgttgt 

tgactctgga 

gaaagggtcc 

cacctgaggg 

aggaaggggg 

tcccacctcc 

cccagtctga 

ttgacgacct 

ctgcagctgc 

ctcggagtcg 

tgccactcag 

agtactggcc 

cagagaaccc 

cttgggatga 

tcctcataga 

acctcgtgga 

gtaggggacg 

gacggcccac 

cagtcttcct 

cagaggggca 

agaaaaagct 

aagcagaaaa 

aagaaataga 

ttttggccgc 

acagggcagt 

attgcaaaga 

tcctaaccct 

taactttgtc 

cagtactcac 

ctggtagtaa 

tgacccactc 

taaccaaact 

aggcccctgt 

aacccaaaaa 

aagcaggaat 

ccacccccat 

ctcatatcca 

caccacttct 

gggaggttaa 

taagctctct 

tttgcctgcg 

agggcggaca 

ccaccctgtt 

agcttaccct 

gtcaccaggg 

ctaaaaaggc 

gcaaaagatg 

ccccacggca 

gctttgcaac 

ggaaagaaga 

cgctagcatt 

tagccagggg 

caaaaaaatt 



ttttgagtac 

agtgagggtc 

cacccagagg 

ttgtgttctg 

accgcgctcc 

ccgttcaccc 

ccctttggag 

tgcccagttg 

ttcaagtccc 

gggtctcaat 

gcagagtcgc 

gtctactgtt 

gcattggaag 

gtggcagacc 

tgcttttgac 

tcaccctgat 

gtgggtcaag 

tgcagccgga 

cctctggata 

cccaccacag 

ccgaggccga 

agcacatgtg 

tttttcctct 

ctctggactt 

ttgtcagcag 

ggcgagaaaa 

cgaggctttc 

cctccttgtc 

caatttggct 

tgagcgtcta 

gagagccgct 

gcaaaggctg 

agtctatcat 

ggagagggaa 

agttgtgaat 

gaaaccgcaa 

gaaaggacac 

agaagatgat 

cgtggagggg 

taacccccta 

attttacccc 

ctttctggtg 

aaaggctcaa 

tgcctgcctt 

tgcagtctct 

ggggttggct 

ttcggtaaaa 

gaggttgctg 

gccggttcaa 

caaaagggtc 

cccacctgag 

tttgcaccct 

gactggtcaa 

tgacgaggcc 

actacagtat 

aactgagagg 

acaaatctgt 

gctcacagag 

ggtacgtgag 

cctagcagca 

acaccaaaga 

accagacttg 

agtattgaca 

agatcctgtt 



ggtctcagtg 

tcccttcggg 

tcctagaccc 

tttctaagtt 

gagagggaac 

tgggagacgt 

gccaagagac 

cgagatcgtg 

acctcgcgtt 

cggccggcct 

cgccgtttct 

tttctagaaa 

gaggtgcggg 

ttttgcgcct 

ttgtcactaa 

cagatcccct 

ccttggaccc 

aagtctggcc 

gactcccaac 

ggaccaatag 

agccccgggg 

agagggccag 

tctgatttat 

actgggctcc 

cttttgcagg 

aatgttctgg 

cccttgaacc 

tatcgccgga 

aaggtaagag 

atggaggcat 

gtagccatgg 

gaggggctcc 

aagagggaaa 

aatagacggg 

gatagacagt 

ggtggcaaaa 

tgggctagag 

tagggaagtc 

actcccgtca 

ggcaagctag 

tggacgacca 

atacctgagt 

gtccaattta 

gtcctcaaca 

tcaggttggc 

aaacaagtgc 

caatacccca 

ggccaaggag 

aaaccaggga 

ctggacattc 

agaacatggt 

aagagtcagc 

ctaacttgga 

ctccatcggg 

gtagatgatc 

ctcctcacag 

caaactgagg 

gcccggaaaa 

tttctgggga 

cctctatatc 

gcttttgagg 

actaagcctt 

caagcactgg 

gctagtggat 



tcttcttggg 

ggtctttcat 

acttagaggt 

tggtgcgatc 

gcggggtgga 

cccaggaaaa 

catttggggt 

ggttcgagtc 

tggtcacgag 

tagaaaggcc 

ggtttctttt 

tgggacaatc 

tcagagccca 

ccgagtggce 

tcgctgccgt 

acattgtgac 

caaactcttc 

catcagcacc 

ctccccctta 

cgagaggggc 

aggaaggggg 

caccaggacc 

ataattgaaa 

ttgagtcact 

ttctttttac 

gagaggacgg 

gccccaactg 

ctctagtggc 

aggtcttgca 

ataggagata 

ccttcattgg 

aagatcatac 

cagaagaaga 

atcgccgtca 

caggaaaagg 

agataccact 

attgccctaa 

ggggctcaga 

acttcctgat 

gctccaaaaa 

aacgagctct 

gccctgctcc 

cttcagaagg 

cagaaaaaga 

taactgcgtt 

ctccggttgt 

tgagcaagga 

ttttagtggc 

ccaatgacta 

accccacagt 

atacagtcct 

tcctgtttgc 

ctaggctacc 

atcttgcgcc 

tcttggtcgc 

aactgagtga 

taaccttcct 

agactgttat 

ctgctggctt 

ctttgactaa 

ctatcaagtc 

tcgtcctata 

gaccctgaaa 

ggcccacatg 



tccgcggctg 

tttggtgcat 

aaggttcttt 

gcagtttcgg 

taaggataga 

acaggggagg 

tgcgagatcg 

ccacctcgcg 

atcgtgggtt 

atctgattct 

ttgtcttagt 

tgtgtccact 

caaccagtcg 

aacgtttaga 

caggcgaatt 

ctggcagaat 

gaaactgacg 

ccccaagatc 

ccccctgccc 

tcagggaccg 

gccagactca 

taatgatctc 

aactaaccac 

tatgttctcc 

aacagaagaa 

cacacccact 

ggactacaac 

aggtctcaga 

ggggcagact 

cacccctttt 

tcagtccgct 

gctccaagat 

gaggcaggag 

ggagagaaat 

taaaataggg 

ggaaaaagac 

aaagcgggag 

ccccctccct 

agacaccgga 

gaccatggta 

tcagatagac 

cctcttgggg 

cccacaagta 

gtaccggttg 

ccccaatgtc 

ggtagaactt 

agctagaaaa 

ctgtcagtcc 

tcgcccggta 

cccgaacccg 

agacttaaaa 

ttttaaatgg 

acaggggttc 

ttttcgcgct 

ggcggcctcg 

cttggggtat 

ggggtatacc 

gatgatccca 

ttgtagactc 

ggaaggggtt 

gtctctaatg 

tgtggacgag 

aagacctgta 

tctgaaagct 



tcccggggct 

tggccgggaa 

gttctgtttt 

ttttgcggat 

cgtgtccagg 

accagggacg 

tgggtttgag 

ttttgttgcg 

cgagtcccac 

ttgagttgct 

ctcgtgtccg 

cccctttctc 

gtggaagtca 

gtaggctggc 

gtttttcagg 

ctcgtccaat 

gtcgcggttg 

tatccagaga 

caacagccac 

gcgggggaga 

acagttgcct 

attcctttac 

cctcccttct 

catcaaccca 

agaaaaagaa 

gccctcccta 

accgcggaag 

ggagccgcta 

gaaccaccct 

gaccccttgt 

cccgacatta 

ttagtaaaag 

agagagaaga 

ctgagtaaaa 

ctcctgggca 

caatgcgcct 

cgatccaagg 

gagcctaggg 

gcagaacatt 

attggagcca 

aaaaatatag 

cgcgatctgc 

agctggggaa 

catgaagaac 

tgggcagaac 

aaagctgatg 

ggcatccggc 

ccctggaata 

caagacctcc 

tacaatttat 

gatgccttct 

agggacccag 

aaaaattccc 

cgaaaccctc 

aaggagctgt 

cgagtttcgg 

ctccgagggg 

tcgccaacta 

tggattccag 

cctttcaagt 

actgccccca 

agagcgggtg 

gcctatttgt 

attgcagcag 
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tagccctgct 
cccctcatgc 
cgaatgacac 
atcctcaacc 
atggacatcc 
agagctccca 
ggagctgcgg 
tcggcacaaa 
atcgttaata 
atctacaggc 
attctggccc 
ggccacccaa 
aaaacaagtt 
tgaggatggg 
tttttgaaag 
accaattcac 
aagtccccaa 
tcactaatgc 
gagtgtattg 
tgttagtatt 
ctgcccagat 
aggtaatcgg 
ctcagttggg 
tagagaagat 
gaaaagactg 
actccttttg 
gtttccgact 
gtgattagga 
gtaccccacg 
cttgagccac 
gttgacggaa 
gaagaaaacc 
ctgcgccgca 
gtcctgggag 
cctctggatt 
caattgggac 
cccaaatggg 
agctgcacat 
cgggggggca 
ctggaagccc 
gtcaaacgaa 
cagagcaccc 
cgagaacgga 
atacattcca 
ggcagtgaca 
tggctccccc 
taccctccct 
taggaacaga 
ctaaccctaa 
aaggaatagc 
gaaaaaacag 
tacctcagga 
gtcagtatct 
gtgtgtctat 
ccagactctt 
gaaaaagaga 
gagtaggtac 
gcagttatgg 
ttaacttccc 
aaaaaaagag 
ggagtaatca 
agaaaaagcc 
ctcctctcca 
tgcatcctta 



gatcaaagat 

cttagaaagt 

actatcagag 

cagctaccct 

tcgctgaaga 

gttggtacac 

tggtagacgg 

aggctgaact 

tctacactga 

agcgagggct 

tgttagaagc 

aaaggagaaa 

gctcaagggg 

aggtataaca 

aaaaatagaa 

ccacctggga 

cctgaagtca 

aactaaagcc 

ggaggtagac 

tgtagacacc 

tgtggccaag 

gtccgacaat 

catcgattgg 

aaataggacc 

ggtggctctc 

aagttctgta 

ctgaccctgt 

cccagatttg 

ggttccgagt 

ggtggaaggg 

tcgcctcctg 

acgacgacaa 

ggcgccactc 

gtgcttaatg 

tggtggcctg 

ctctcagatc 

atagggagca 

ttttatgttt 

tcagattact 

tcctctaaat 

ggagaaagaa 

tcaggaccat 

aaacaacacc 

ctaagagatc 

ctcgtagggc 

aaaggtcccg 

aggaactaat 

ggaccgtctg 

tatgactcaa 

tcagatcagg 

aaagttgact 

taaagggcac 

ggtgcctccc 

gtctgttttt 

gtatcatgat 

acccgttacc 

aggaaccgct 

atattgatct 

tgtccgaagt 

gactctgtgc 

aagattctat 

aacaaagatg 

ctatagcagg 

ataagttagt 



gctgacaaac 
atcgtgcgac 
cctgctgcta 
tctccctcta 
aactgggacc 
ggacggcagc 
gaaaaaggta 
tatagcgctt 
cagccgatat 
attgacctcg 
catacatgca 
acttggtggc 
ccatgatctt 
taaaagagct 
ttaactcccg 
gttgaaaaaa 
gtggctcaaa 
tacagagaac 
tttactgaag 
ttttcaggat 
aagatccttg 
ggaccagcct 
aaattacact 
ttaaaagaga 
cttcctcttg 
tggaggacct 
cttaccctcc 
ggaccaactg 
tggagacaaa 
accctatttg 
gatccacgcc 
ttggacagtg 
tagacctagg 
aaaaggaaaa 
atctcacgcc 
atactgatct 
catatgggtg 
gccctggtca 
tttgtggtaa 
gggacctaat 
acccctataa 
gcaaagataa 
gtctaagttg 
ctgggttcat 
ccaacaaggt 
actgtaccag 
actctcctca 
gtcagtctag 
tcatgctggt 
acttataata 
ctagcagcag 
ctctgtaatc 
ctagacacag 
aatagttcca 
aatagttctt 
ttaactttgg 
gccttaatta 
tagaactata 
ggtgctgcaa 
tgccttaaaa 
ggctaaactt 
gtttgaaagc 
acctttaatt 
agcttttatt 



tgacaatggg 

agccacctga 

aatgagcgtg 

acaaatgatt 

agaagtgacc 

agtttcctga 

atttgggcaa 

atacaagccc 

gcttttgcta 

gctggtaaag 

cctaaaaagg 

caagggcaac 

aactgaaaaa 

atggtagacc 

aagaaggaat 

tgatgagact 

agattataga 

ctggaaagag 

ttaaacctga 

gggttgaggc 

aagaaatcct 

ttgttgccca 

gtgcttaccg 

ccttgactaa 

cgctcaaaca 

ccccccttaa 

tctttgctta 

aaagcagcct 

gtcttggtca 

gtgttactga 

tcccacgtca 

gcagtcactg 

gaaccttaac 

cattgtatgg 

tgacatctgt 

tagcaaccca 

ttcggggcag 

gggtcagagc 

atggacatgt 

cacggtaaaa 

atatcaagag 

atactgtaac 

gcttaaagga 

tttcacgatc 

ccttataaaa 

ctccaccaac 

taaagcctac 

tccaaggagc 

tatgctatgc 

ctacttcaga 

tttcaggaag 

agacccagaa 

tgtgggcttg 

aagatttctg 

ttttagataa 

cagttctatt 

agaccccccc 

gaacagtcta 

aatagaaggg 

gaagaatgtt 

agagaacgcc 

tggtttaata 

acacttatgc 

agaaaaagga 



acagcaggtg 

cagataagat 

taacctttgc 

ccgtcccagt 

tgactgacca 

tagaggggaa 

gcgctttgcc 

tccgagaggc 

ccgcacaca : t 

acattaaaaa 

tagccatcat 

cgaatggcag 

ggtgatccgc 

agtgatcccc 

aaaatttgta 

aattaaaaat 

ctcctgcaaa 

acaacgggga 

aatgtatggt 

atttcccact 

gccaagattt 

ggtaagtcag 

ccctcaaagc 

attagccatt 

cccctggtcg 

tggaagctgg 

ttcatttaaa 

ataccccagg 

gacggcatcg 

caacccctac 

agagggccgc 

acaatcctct 

cctcatgctc 

gcaaccactg 

aagttagcgg 

ccccctgagg 

ttctaccgag 

aaaaggcttc 

gaaacgacag 

cgaggtagtg 

agtgggtgcg 

cccctacgta 

aataggtggg 

agattgacag 

caggggcccc 

tccacagccc 

cttggcttcc 

ttttttagtt 

ctctagcccc 

tcattctcaa 

agggctttgt 

catccagtct 

caataccggt 

cattttggtt 

atttgaacat 

aggattggga 

aatactatga 

taaccaaatt 

aattagactt 

gtttttatgt 

tagatatacg 

agtccccttg 

ttttgcttac 

taaacgcagt 



accgttgtag 
gacaaatgcc 
gccccctgcc 
acatcaatgt 
accctggcct 
gcaaaaggct 
tgaaggaaca 
taaaggtaag 
ccatggggcc 
caaagaaaaa 
ccactgcccc 
acttagtggc 
ccaaaagccc 
tcccatactt 
aaaggactac 
tcccgatacc 
ccatgtgcat 
gaccatcctg 
aacaagtatc 
aaaacagaga 
gaaatcccta 
ggcttggcca 
tcaggacagg 
gagaccggca 
tttcgggctc 
tggaacatta 
ggccctaaaa 
gaccaccgca 
aaccggtagc 
tgcggtaaaa 
cagtcaagat 
taagcttcgt 
caattcaaca 
cagtccatcc 
caggatcccc 
agcggtgtgt 
ctaatcttag 
aacaaaaatg 
gagatgctta 
gctatgataa 
cttttaaaaa 
taaggttcac 
gttggcgagt 
tgagagaccc 
ccagtcgtac 
aacacagtgg 
ccaccgcccc 
ctaaatagaa 
ccttattata 
tgcctttggg 
ctgggccggg 
agcaaaagcg 
ctcactcctt 
cagcttattc 

c 99gtccgct 
gtagcagctg 
agaactacgt 
agaagaatct 
attattcctt 
tgaccattca 
taaaagagaa 
gctcaccact 
ttttgggccc 
ccaggttatg 
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gtactaaggc 
tatctcctaa 
cagaacccga 
agaatgcatt 
tagaacattt 
caaataatag 
tgacagaacc 
ttttactata 
gtgcatgagt 
ctattaaatc 
tgtcccgggg 
cagaggaaaa 
agacaagatg 
tgtgtggtgg 
tacctctgtc 
cagataggaa 
ctcaatagga 
gctggccagg 
ctctgtgagt 
aggagcctgt 
tttaatcccc 
attggagcta 
agcttatctc 
cctccacaca 
cagctgcaca 
tattagtttg 
ccttaaaaca 
aagagtactt 
agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcattcc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 



aacaatatcg 

aagaagtggg 

cccctcccat 

cctgaactcc 

ttgagataag 

gacatgaccc 

ccctagtgat 

taagcctgta 

ttcgacccca 

ttgccttcta 

cttgagtgct 

atgtattctg 

gaacagctca 

acaggaaaca 

cgtcaggccc 

ccaagtatca 

gtgtgttcca 

agagctgcac 

tcaaggcaag 

ttcaaaacaa 

atgacctcaa 

gtgagatggc 

tggggacaca 

tatgcctccc 

tggtcatgca 

agatcgcata 

caaaaatcaa 

tgtagtaagg 

ttccgagtgc 

gggtacactg 

ctaactgctc 

tactgctcag 

gcagaattca 

cttaaaggtt 

tccaccctaa 

ccagcccatg 

tgttggtgag 

ctgaagacca 

cctcatggac 

cccagacaat 

cacacacctc 

ctggtctata 

aaaccaaaac 

gggtaccagg 

ctagtgaagc 

agctttcctt 

ttaagaggtt 

atgtcttgac 

tgctcctcac 

tgagaactgg 

tcacaccaga 

cactttgctg 

taacgactgc 

cagtctgaag 

attgcttgct 

aagaaatatg 

tattacagag 

atgttggtac 

tagtttgctt 

ctatggtctc 

caatgtaagc* 

tctgtgatga 

ctcagtgact 

atggaaagat 



ggtccttcag 
gaatgaagaa 
ctagagattg 
tcaccctaga 
ggcctcctaa 
cttagttacg 
gtaaacttgt 
aaaaatttgg 
gagctctggt 
cattttgagt 
tgagtgaggg 
cctcatgggt 
acctgctggc 
gagagctgcc 
tacatcttaa 
aaccaccagt 
ggatgtggac 
acctttaatt 
cctgggctat 
acaaaaccaa 
atagccaaac 
tcaacatatg 
cataatggga 
acaaataaga 
tgcctataat 
gcaagcagta 
ttaagcaaca 
acaataccaa 
tcagggatgt 
ctctcagagc 
aggcattcag 
catggctgtg 
agagttatag 
ggatgcacca 
tccccaaaga 
gccttgaact 
gtattccttt 
gcagctctcc 
tcagccgcta 
actatgtaag 
tactcccagc 
gagtgaattc 
cgtaaattca 
ggttgaactc 
tgaggcaggt 
tgagagccta 
ctggacttgg 
aaaggagatc 
cactgcagga 
ggaaaactga 
cctaaccaac 
ggctcatcct 
taagaagcag 
tctttaaaat 
gtcacctatc 
gcatcaatac 
tgcaagggaa 
agttcatgaa 
ttttttttct 
aaactcacca 
atgtgccacc 
ttaacttcaa 
gggtgggcta 
caagtccatt 



gaggttgaaa 
taaaaatttt 
ttcccggaac 
gttcgaaccc 
aacaaccgca 
tagattccct 
actttccctg 
ctggtcgtcg 
ctatgttcca 
acggtctcag 
tctcccttcg 
tcagggggtt 
aggagggtgt 
tacagtctta 
aggatctaca 
ttgtagggga 
aaggagaaca 
ccatcactcg 
acaattctag 
accataaaaa 
agctctcctc 
aaagtccttc 
gaggaccaat 
aaatatatat 
ccagcactcc 
ggctagacag 
ataacagtaa 
aaatgttcct 
ataaatactt 
tgcccagaaa 
aggtcacttc 
gtacttggca 
catcatgaaa 
agtgcctttg 
cttctggcca 
cttgatggtc 
gctgataata 
aggagtcctc 
ctagattcgc 
ccaatcccat 
acgcaagagg 
caggccagcc 
ttctatcagc 
aaagtcttca 
acagccggtt 
caaagaactc 
atctgtagct 
acttgtgttt 
ctgcagatct 
ctttgtattt 
tgccaagagt 
catccttggc 
tctttccttc 
gtgtttgtta 
atattctatc 
ccactgcaaa 
aagaaaaaaa 
gaggttctac 
ctttccttcc 
ctgtaaatca 
aggcctggct 
tcatcaactt 
agggcatgct 
gtgagcagca 



actcgctcta 
tactgaactc 
actcctgaac 
tcccaactaa 
aaatgaaccg 
tggcagaacc 
cccagctctc 
attctcctct 
tgtgctttqt 
tgtctt.cttg 
ggggtctttc 
tccctcagca 
gggaaaggac 
caggcctacc 
gtttattaaa 
taaaaataca 
cagttgttta 
taagagggaa 
attagccaga 
agcatttctg 
agtccaaacc 
ccaaaaatat 
ttctacaagt 
aataaaaaga 
agaggctgag 
ggctacatag 
caaccacaac 
ttaaggacag 
agaaaacttc 
gttgtttatc 
tgtagtagcc 
ttatcatttt 
gtttccacca 
gggtgataaa 
tctcataatg 
ctgtctcagc 
caagcaaatt 
aatgcagact 
aacctattca 
tttaatacac 
cagaggcagg 
agggctacac 
tctatttcct 
tgtttacgta 
ctttactgct 
tttttttctt 
gtcatatcac 
gctgcagtgt 
attctgcctt 
gggctttgaa 
tctgctgact 
ctgcagttta 
cagaaatttt 
tggtagaata 
aggaagcaga 
aagtgtaaat 
tcagccagtt 
aaaatggctg 
ctttcctttc 
ccttgaactt 
cacacatttg 
gacataacca 
cataagggat 
acacgccctg 



agattagagc 
ttcttcaccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
ccttgtcccc 
cccccttgag 
acaccactag 
tgctgttgtt 
ggtccgcggc 
aaaactactt 
aattcaggga 
aagtgttcat 
accactgacc 
agaacactac 
aggaacacat 
aaagcttaac 
gcaggttcat 
gctacatcgt 
aggctttggg 
aaactgcaaa 
tgacaactgt 
taccctctga 
aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
agagaaaacc 
tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
aaataataaa 
tcagaattgt 
ggggtgggaa 
cttacaaggt 
ctgatccttc 
gtttttcaat 
agaatcgtct 
tatcctgatt 
aacagaagtc 
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ttctgaagta 
cgtgtttatt 
aacatgaact 
tatgctaaga 
agctttggaa 
atctgatttt 
agactcagaa 
tgtcattgtt 
ccaggctaga 
tttgagacag 
ctggcctcga 
gtgccaccac 
tgttcctgcc 
tagtaaacca 
cccaaccatt 
tttttcagct 
ggttctgatt 
tgcggtaagc 
ccaggtttgt 
cattggggca 
gatttggcaa 
caacacaaca 
aagagcaccg 
tcacaaccat 
tgtactcaca 
atgtacctca 
cattttctta 
cttgacacag 
ctctggtggc 
agaagcaaag 
tacatgcatg 
tcattaaaaa 
gctttgaaag 
aaccttttga 
taatggcagc 
tagtgttttt 
acctttgtgt 
ggcttacaca 
ccagctcact 
gggttcatgg 
ccatccttag 
ttagtttgat 
aacgagtttt 
agttatgatc 
tattgcctga 
ttgactggtt 
cctccagact 
tgtgatcatg 
gggtggaaag 
agaaatgtca 
tttatatagg 
aactctaagc 
acccttcaag 
tacgcacaag 
acacaactta 
acatgggaag 
nnnnnnnnnn 
nagagagaga 
gctgagtcag 
cgctgtccca 
gctgcctcat 
cagctgggcc 
gtgctttttt 
aatccttgca 



taagaggaga 
ctctgtctgt 
gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggtac 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 



aagcttgatg 
tcttgaccgt 
attgcaaggc 
cagcaataga 
ggctgttata 
tcatgtgttt 
gacttgaatt 
gtgcctcagc 
actcctctat 
tgtagccctg 
ctgcctgcct 
gaactcgtct 
atagtcaatt 
tctagccaaa 
tgtgagggga 
tgcaggtcat 
aatgaggcaa 
tcttgcttgg 
taggcctctc 
aaacacggag 
ctcccttctc 
caagaattgc 
tgaaggttct 
atctgattac 
aaataaatct 
gcttcttcct 
tgcttctctg 
gttctttaaa 
atgactgatg 
ccaaagccgt 
acacacatac 
ggctcagcaa 
gatattaatt 
aaatataaca 
gtttcttttt 
atattccaca 
gattgaactc 
aggcttgtat 
gtttaaaaaa 
ttatcttaaa 
aggagaaaaa 
tgaagatata 
tcaacaaact 
ggtgggtcag 
tgcatccaca 
ttttagtata 
atgttcttga 
cactcttgca 
ccagaggaag 
tagacctctg 

aggggaaaaa 

tgttgggggg 
gctggttatc 
tggatgggca 
ccatgaattt 
gnnnnnnnnn 
nnnnnnnnnn 
tcgttggttg 
gctgagacac 

tgggggccct 

tgaggttgtg 

a gggg a gtct 

aatgttccct 
agacaggcaa 



agagcaagca 
agatgtgatg 
aaacaaacct 
aatgaaactt 
attcactagg 
gtttctggat 
ccgttttaga 
tgaaaataca 
tttttgtttg 
actgtcctgg 
ctgcttccca 
attacacatt 
gattcctgtt 
gtctgcctat 
atccacagat 
gattcagtgg 
gcagagggca 
aggcttagtc 
tcttaaattc 
tgtgactcct 
cctagggtca 
cattagggct 
gagttcaaat 
ctcttctggt 
aaaaaaaaaa 
cccctgacta 
cttctaccct 
gcagggctca 
atctaaggtt 
cctctgacca 
acagaagtaa 
actttccgtg 
atagttattc 
gagtgtattt 
tagattattg 
gcatgtgtgt 
aagtcatcaa 
ctttggaagg 
agaaaaaaaa 
atcaactgca 
catacagttg 
atcagctcac 
acccaagctg 
aagcctaagc 
ttcacatcct 
tgtgctgctg 
ctgccacaga 
cccaaggctc 
aagaaagtca 
aagtctcaag 
attcttacag 
ggggtagtct 
caaaacaaac 
gattgcattt 
aagagagagc 
nnnnnnnnnn 
nnnnnnnnnn 
cttctaatca 
tgtccaccca 
tgaaatgaaa 
aaaatagccc 
ggtcctttgg 
gctttaggct 
gaatatcatc 



ggcaagcaag 
gctgtcttgg 
tttcctcttc 
agaacaggcc 
atagggacca 
atgatagatt 
gacattccag 
aagagtacag 
cttgtttgtt 
aactcacttt 
agtgctggga 
aacacccctc 
tatttaccac 
tagccatatt 
gctgtagacc 
gtcatgaaat 
tattgtcaca 
agtttctggc 
caaacatagt 
ttccccatct 
gttttacctc 
ggtgagatgg 
cccagcaacc 
gtgtctgaag 
aaaaaaagaa 
cactcacgct 
ggctccttct 
ggaccagtga 
aagcctagaa 
ccatgtgtaa 
aaagagattt 
tagaaaacta 
attatagcag 
tttttgtaat 
ttattatttt 
ggaattggat 
gcttgcacag 
caagcatttt 
agtccttttt 
aacttttatc 
gatggcttta 
atcacactca 
agcagctcca 
gggcgtggct 
ttcctggtga 
aaacaatctt 
gcactaatat 
ggggaacatg 
gaggtgagac 
aacaaggcct 
gggtccaacg 
tccccaggga 
tggtcagtcc 
aggaatacac 
aaagcaaagt 
nnnnnnnnnn 
nnnnnnnnnn 
ttagaaaacc 
ctgtcccaga 
gtcaccagca 
tgcagtggtt 
agcagagtta 
cagtgctgtg 
cctgttttgc 



ccaggatcca 
cttcctggga 
caagttgctt 
cataactgcc 
ctggaagtga 
tatcaatttg 
agaaaactga 
ggaagaaagc 
tggttggttt 
gtagaccagg 
ttaaaggcgt 
tttaattaac 
atgtttctgt 
ttcatcagtt 
gctttagaca 
taatttagtg 
gcacagcaca 
tctaaacgcc 
tagacattac 
gcggttccca 
tcacactcca 
ctcagaggtt 
acatggtggc 
acagctacag 
ttgccattaa 
ggcctgagtc 
gctgcctatc 
gactgatcgg 
cccacgaggt 
actgcacatg 
aaattgaaaa 
gagtacttag 
agatttctaa 
gtaagtttac 
tattaattat 
ttctgcttcc 
taggtggtca 
acttcctgtg 
tgtttaatta 
tggtaaaaag 
tcctgcaggc 
agcctctgcc 
aacaacagcc 
acctcgctgc 
gtgtggttct 
tttgcctctg 
ttactgaaca 
gaggaagagg 
tgcatctcct 
gaaagtctta 
ttagacaaag 
acactcctct 
tgaagccata 
acatacacac 
gggaaggggt 
nnnnnnnnnn 
nnnnnnnnnn 
actctcttag 
gcaaggccat 
ggctccggaa 
cctgggcctg 
acagtcatca 
cgctacttct 
cctcaggcaa 
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attctgaagt 

attctcactc 

ccacctccca 

gagaatgaag 

tgaacataaa 

gagaaatgtc 

ccccaaacag 

actcatgttc 

gcaccaggag 

tcccacagtg 

cctgggcttg 

gggacgtaga 

aatgccagca 

acagactgag 

agatagtggg 

acaaaaatgg 

tttgtcacct 

gttcccagca 

ctacaaagta 

caatcaatcc 

cttccagagg 

ctccaatccc 

agcatacaat 

gtacagaaag 

tttcctgtct 

tggtgtagtc 

aacaggatca 

ccccggctgt 

caactgagga 

tctggaattt 

gcagaactgc 

gggtctgtgc 

gaaaaaaata 

gaggcaggac 

cctctgatca 

tccattagag 

agtatggtag 

agaagttcag 

gttttgtctt 

tttgctgcca 

atgacgcttg 

actaagtgga 

tcgttgggaa 

gaaagagcca 

ccaaagctgc 

cacaggaggc 

atgaaacgtc 

gggctggcag 

tgcttttcca 

gctggacttt 

ctggctctgc 

gagaggtttg 

ctcaaaggca 

ccaagcccta 

agggggctgg 

attccgttcc 

gaatctgttg 

cgcaggcaaa 

tcatgccagc 

cagcctgagc 

aaaacaaaaa 

caatggatcc 

ttctccatag 

agagttgagg 



ctggcaaatg 
tacttctgcc 
tgctcagggg 
tagaagccag 
gttcagttac 
aagtgctaca 
atacaaacct 
cagcccagtt 
gaaatataat 
actgtggata 
agatctagca 
ggcagaagga 
tttgagaggc 
ttccaagaca 
agagaattca 
agcatgaatg 
acataccaat 
tctgggaggt 
agttctagga 
atcagtcaat 
acctgggttt 
aagggacatg 
atatatatgg 
agatcaacta 
gacatctgac 
cctgtgaccc 
aatcccacag 
ctgtgcagca 
gtgcctcagt 
gtcatttgat 
agaggaaaag 
tttctcttac 
gatgcatcct 
tcatcatgca 
aatcagtttc 
gcctgcatgc 
tatctgttta 
agttatcctt 
aacaaaacga 
agtttgatga 
aactccacgc 
taaatgttaa 
gggaggggaa 
gtgtgccacc 
ccagtgtgcc 
cagcaggagc 
tttccctttc 
ggctcatcca 
aggtcttagt 
actctgctgt 
caaccaccat 
gtctggtcct 
agtgctggtc 
gaggacagac 
agaaatagct 
taacattttc 
ccctctctgg 
acactcatac 
gatgctctca 
tatagaatga 
caaatctgtg 
ctagagctct 
accacatttc 
tttacatggt 



aaagatgtgg 
tgtgccacct 
ctaggaagtg 
cttcaaatct 
atttggttta 
attcagtgga 
tcatcaaagt 
ggctccttct 
agatgtcctt 
cagcccagtt 
ccctaaggca 
tcagttcaag 
agaaacagat 
gccaaggcta 
aggttatctt 
cttgcaactg 
tatacaaatc 
agagatgggc 
cagccaaccc 
catgggctgg 
gattcccagc 
acaacttcta 
gtaaaatgct 
taatgaaaca 
ttcttgcgtg 
aggattaggc 
tgctccactg 
ccgcccagat 
gctcctcagg 
atttccagac 
gggcactact 
aaatcattgc 
ca cagtacag 
gcctctttcc 
aggcctggaa 
acgaagctct 
gcatggtgtg 
gcacttacag 
aacaagaggg 
cctgagtttg 
acatgccaca 
aaataaataa 
aaggcaggcg 
aaagctgccc 
accaaagctg 
agccaggttc 
ttcagaccac 
ctctagtgtg 
gtctaattaa 
aaggactttg 
gtcagtgggt 
tgtctaatgc 
cagcatgtgt 
atgctttaaa 
cagcagttaa 
atggtggctc 
ttttctcagg 
atatgaaaca 
gcactcagat 
gatgctgtct 
ggcttaatca 
gctttgcccc 
tattttgggg 
catttttttt 



gatttgaaca 

tcctctcatg 

gggagaagat 

aaactaagca 

aaaaaaaaaa 

tgtggatgaa 

ctcttccaaa 

catgtgagct 

tttaaggggg 

agtagagttc 

tagtggtaca 

gtcagatgag 

gaatttgtga 

cacagagaaa 

ggactgcata 

tggacaatat 

agattcagct 

agagctctat 

tacagagaga 

agagatagct 

acccacatgg 

ctggtctctt 

atatatataa 

accataacac 

acccagtgcc 

aaccactgcc 

taccccgcca 

gcttggtgtc 

gcagtgtgca 

tccagctgac 

gtgactgtta 

ggcctctctt 

gatggcaggt 

ctcaactaca 

agaaacggcc 

ggatttgatc 

gaggtatacc 

taagttcaaa 

gccggggaga 

gtcccttgag 

gcccacgcat 

atcaaaggaa 

agggagatag 

agtgtgccac 

cccagacttg 

cacctcagag 

agcagtgaca 

cctgtggcca 

ggttagcagc 

ggcattgttc 

cttcagaggt 

aaaatgcctg 

ggaatcctaa 

aaaaagtcat 

gagcactagc 

aaagctgtct 

caccaagaac 

ttttttataa 

ggcagaggca 

caaaaagaaa 

ttcctagcca 

ctggtctgga 

accactctgg 

gttgttacag 



cagacttgtt 
cacggggagg 
ggatgtcctc 
atgttttatt 
tccctacaca 
acaatcaaaa 
ggctgggtct 
ccgacttcca 
gtggggtctg 
tcgcctagca 
tgcccatgac 
gggtggggag 
gttcaaggcc 
ccctgtcttg 
agactttgat 
tgggttcata 
gggcccactg 
gactttaagg 
tacactactt 
cagtgatcaa 
cagctcacag 
tggtcaacag 
aaatcagatt 
atactgtttt 
tccacaatga 
cctgggaagc 
gaggacgtga 
actgagtgac 
attgaaactt 
cttggataac 
tcccctgcat 
cacttccctc 
acgaggcggg 
ccgccgcccc 
actcaggctg 
ttcagcacgg 
tatctctgca 
gctagcttgg 
tgcttcgctg 
cttatggtgg 
gaatgtgtat 
tggccactca 
ataaccctga 
caaagctgcc 
attacagatt 
gtggagccac 
gctgtcctgc 
gaacaggcct 
caaattggag 
cattccgtga 
agaagaactc 
gggcaccagg 
gttcaatacc 
gctttaaaaa 
tgctctttca 
ataattcaag 
acatgtggtg 
acctgctcgg 

gggggatttt 

aaaaaaacaa 
gaaggagctg 
gagcttgctc 
gcctgcacat 
taatgagagc 



tggccaaaga 

ggaggggagc 

aaagcagggt 

tccatttccc 

actggttctt 

tgttgaacac 

gaaaagagcg 

aagactgctt 

tctgacaacc 

agcgtgcggc 

cacagcactt 

gcattccttt 

agcctggtct 

tcaggaaaaa 

tccaaaataa 

catattctgt 

gtgcatgttt 

ctagcctggt 

ctaaatcaat 

tagtgccatt 

atgtctgtaa 

gcatgcacgc 

cacaaatcaa 

aaaagctacc 

gacaagcaga 

cattcctagt 

agcctccctt 

catcggagcc 

gcatgtgatt 

agcacagagg 

gattatagaa 

ctttgaagtc 

ttccgggact 

tgcagagttc 

gggatgtggt 

gcataagccc 

cagggagacc 

gcaacatgaa 

tgctgagtca 

aaggagagaa 

acacacacac 

aaatctacca 

tatgaacacg 

cagtgtgcca 

tggccaggga 

aaacctggaa 

agagtctgga 

cagtcacagg 

agagaagggt 

tcaaatacca 

atcctttttt 

ttaatgtcaa 

catcagagcc 

aattctgttg 

gaggacccag 

tcctagaggg 

caaacataca 

atgtggtggc 

gtgttgagcc 

aaaaaaacaa 

gctccagcaa 

tagaaaggaa 

ggaaaaatgg 

tttgaaatga 
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tcacaaaagg 
aaccgaatcc 
ttatggcaac 
tgcaaagtct 
gtcataaata 
ctaatgagga 
ttctaatata 
taaaatgata 
atccaaagca 
tctcatccgc 
taggacagac 
tattaggctt 
aaaaagcaac 
ttgcataatg 
gaacccggca 
ttgcagggag 
gacctgactt 
ggattgctct 
agcatggatc 
tgcataccag 
aaccacccaa 
ggattcataa 
tacccaaaat 
tttggaaaac 
gcaccaagac 
gccgctcacc 
cacagtcgca 
tttgtgaaac 
gtaattcaaa 
tgcctactga 
taacttccag 
gacccccatc 

agggtcttgt 

gcattattta 
catttcctct 
tatccttgat 
tttctgggtt 
accaacatga 
acaggcatgc 
cagagtaatt 
aatgagttct 
gggtgatgtg 
tggggagctg 
ggtacactcc 
catggcgggt 
aagtttgaca 
ccacaccacc 
caacaagaga 
ttgtgaggaa 
ttagtcggtg 
gaacaccagc 
ccgaattcca 
tggcgctcag 
tatcccctgt 
ctcccctcct 
gaccccctcc 
tttatcaagt 
gtttctattt 
actgagggtc 
accacgcacc 
ctcgcccaaa 
aaaagctacc 
acagggcggg 
tcaatattta 



aaaataggaa 
taaaaggaat 
tgacatttaa 
tgcaggaact 
tagcttattc 
aatatttcca 
atttatcaat 
aatagcaata 
gtgtagctat 
agatacagtc 
gaaggggggc 
tgttataaga 
tacttggggt 
aacaattttc 
caggctcata 
agaaagcagt 
taaacagttt 
caaagactct 
tccggccctt 
gaggactgaa 
actcatttcc 
aataaggcag 
ggaactgcat 
tggcacagct 
ccagggctga 
aatcccggct 
ctatgctctc 
tgttttcgaa 
actaattacc 
ttgtgcgctg 
gctaaatcag 
cctgacttcc 
agttctggct 
atttaggcta 
tcttcaattc 
cgtcccaccc 
tgcaaggtgg 
gcggcgcaca 
acacaccggc 
aggcctaggc 
cttcctttgg 
gctgaaacca 
cagtcggaag 
cagaaataga 
ttttagacat 
gagtgatatt 
gccacccaag 
gggagccggt 
ttgagtgtac 
aagcaggggt 
tcgaggaaca 
cagggcttaa 
caatctcatt 
atctgactca 
cctgcctatg 
actggactct 
gttctgttgg 
cctttacatt 
ctggtagttt 
aagcaaaaat 
gccagattta 
acttggcaga 
atggggcggg 
acctaactgg 



aatatgcctc 
gttcaagtaa 
aagtaacagg 
tcatgctaaa 
ccaaatcctt 
caaaaaccca 
ttaaacacat 
gtggtgaggg 
atttaaatac 
tagactcaga 
aataagaaat 
ttacatcaaa 
ttgcgcctgg 
attcaatcag 
gtttttatct 
cgggaggcat 
ctctctggag 
cggactcctg 
ctcactgctt 
gtttggctcc 
gacttggttg 
aggcagagaa 
tttcatgcac 
gacattggct 
ggcctaatcc 
ggagccgcag 
ccgtggggag 
aacctgggat 
ctgagcacat 
cagctcactc 
gcagccacag 
aggcgaggag 
gccgaattat 
gaatgacctc 
gtggccccca 
cacttggcta 
tcaggtccac 
gccgccatca 
agacatgtgc 
agttcctgaa 
gggtgctaaa 
tttctgcagc 
aggcacaatt 
cagatgaggg 
ttaggaatat 
accgggttta 
ttatcacatg 
ctatctttgt 
aacactccaa 
aactgcgctc 
ccaaaaaggc 
gtattaagcc 
tattatttct 
cattctcctg 
catgctttgc 
ctcaccccac 
tgagccttct 
tcttgtagcg 
tagagttgga 
tgcacccacc 
ttgctcctgc 
agatcaagtc 
tagggtggag 
taatttgctg 



ctaaaaagag 

aaaaaaaaaa 

atttgggcaa 

attatgcttt 

agaatttcta 

cttaccatta 

tgcataatgt 

gtggggggca 

ctcaatccat 

gcagcatatc 

gacaggaaac 

gcagttcata 

gggctgcctc 

gatctcagca 

cccagctcca 

tgtcctagtg 

gttgaaaggg 

ttacaggcaa 

tcttgaatca 

cggtgaccag 

gtcaaatttt 

acggagggtg 

aatagaaaac 

agaggaagga 

gcctttatcc 

aagagctctc 

gccgctccgg ■ 

gatcatttaa 

ttgaaacatt 

tggtgtttct 

gcgctgcctc 

gcggcccgtt 

tgctcttatc 

tttccctccc 

ttttctgatt 

catcttcatc 

cagtcatcct 

ctgagagagc 

acatgcgctt 

gcaaattcat 

ccagcatgcc 

caagcctgtg 

ctgggatcaa 

ctgcctcctt 

aaaagtaggt 

actagagcca 

agccataatg 

ccaaaggaaa 

taacatcccc 

gagcagtctg 

cgattaatga 

ccaaagaaat 

ctacaaagat 

ctcagtaagc 

gtcttcacaa 

atacttggaa 

cttgcattaa 

tctcacatgg 

gttagatgac 

cttccctcag 

tgtaaagtgt 

agaagtgcag 

cgaataattg 

tgacaattac 



ccgaggcaaa 
atatggctca 
gtttctttct 
aattttaatt 
ccctgcaagg 
agaaccccca 
gccactctgt 
aaaaccagga 
tgtagaggaa 
cttgac.tgta 
ttcagaagaa 
tgttttaatc 
tgtgtactga 
gagatagctc 
cctgctggag 
gctgtgtacc 
gctctgtaag 
gtaaggtcct 
gggatttaga 
aggacaaggt 
caagtttccc 
tgtgtgtgtg 
ttaaagactg 
acggccaggg 
gagggtttag 
ttcacttggc 
gagggggagc 
atgtttaaaa 
tatgccatca 
ataaactgct 
cagccctggg 
tctccagaga 
cgtgttcata 
gagtcttcct 
ggtccaaata 
tgggagccaa 
aaggtgtgtg 
acgtgccctg 
tccccagcaa 
ttcccccttt 
agtggctaga 
ggcagaagct 
gaaatgagca 
attagcgctt 
tggattccca 
ttaagagact 
caagagaatt 
tgagcagccc 
tgcaggattg 
cctgtgtacc 
caaaggacac 
caaggtctag 
ccaacactca 
catcctggtt 
cgacagctgg 
ctactccttg 
agctgtgaga 
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ccagcaacat 
atgttcctgg 
atcttctcta 
gctagcaggt 
aagctccaag 
gccatgaagg 



ttattatagc 
tgcgaagttg 
ccaccctctc 
agccaaatag 
agctgaacta 
tccgatattt 
agcattccat 
gattaaacat 
aacacactgt 
agggtattaa 
ataaaatttc 
tggggaggaa 
accagacagt 
ctactcaaag 
aaaccttgta 
taaagttaca 
ataccagagt 
agcagatggt 
aattgctatt 
cattgtttaa 
agcagtctaa 
tgtgtgtgtg 
aaccaatcat 
cgagccagct 
tgaggctccc 
tcagtcccag 
gacatcaagc 
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tcttggatcc 
tcagcgattt 
ttggtggaga 
gccgtttgtc 
attctcatct 
ccctcattcc 
tatagacaaa 
tgtggtgagt 
agagaggtag 
cagctcaggc 
accctgcttg 
tccagaataa 
agcctgagat 
aaccttgggc 
ctggtttata 
tgaagatgcc 
cagtcagctg 
cttcattatc 
ttcattccat 
agcgtgaagc 
cctctgcgat 
tggcttgcaa 
tcatagaggc 
gccattctcc 
atttcccagg 
tgaaacgggc 
taatttgcaa 
gaactacttg 
aggaaccaca 
ggttagatat 
gccttccccc 
catcttataa 
agcctcactt 
gacggtgagg 
agttaccagc 
gaacgctgcg 
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actatgcaag 

ttaataacac 

cagcatatgg 

ttcaattcgt 

catttgactg 

gcttcccaca 

aactttaata 

gagcgtgaga 

tgagacaaaa 

atgggagttt 

ttagtggttc 

tatggagatg 

gtagaagtgg 

tgcaccccaa 

aaattttagg 

tcgagttgag 

ttttttttta 

tggctccctg 

aaccaccgaa 

tacaagctct 

aattaccgac 

ttaaaagttt 

attagggact 

aattatacat 

aggttatgaa 

cagctccaag 

ggatcactta 

ccatcttccc 

gggtctctcc 

tttttttttt 

aactcacttt 

ctcctgagtg 

tgaagttccc 

tccggctcag 

aacccccaca 

gcaaggctac 

tgtcggaggg 

tgtcctttgc 

gccttggtgg 

atgcctccct 

atttacctgc 

agagaccctc 

gcccatcaga 

gtgattcagc 

gccactttta 

gggtatataa 

cctcacgcct 

actaagtgtg 

ggtgcctggt 

gttatgtcac 

tgctctcgta 

gaactgtaag 

aaagtaactt 

ttgtgtctgt 

ccactgacat 

gtcattcaga 

cctagggtac 

ttgtggatta 

acatcaacag 

cttgaacaaa 

tccggccttt 

ttatttcatt 

ctaaggttac 

tgagatgaag 



aatgttgctc 

acagaatgag 

ctgtgcacat 

cctcagatag 

acaggcatct 

gcccggcttt 

aggattttct 

caggaatggg 

gaatattaat 

tgaggcatgg 

atttatggag 

tagcttataa 

tgggggtggg 

gcctatgttt 

ccaatagaaa 

cagtggtgtc 

aagtccactt 

cagcgagagg 

agggaactgc 

gagcaagcta 

tgggtaatcc 

acacaatcac 

ttttgggtgg 

tgagatttgt 

atggccaggt 

tcataagctc 

ggctacccac 

aacatggctt 

aatgtggact 

tntttggttt 

gtagaccagg 

ctggaattaa 

agtgacctgg 

aattttgggg 

tfcgggagagg 

acagcaagaa 

tgtgaggcct 

tccattactt 

ctgcaatctg 

ccccaccctc 

cagcctttgg 

acagaagcac 

agaacccagg 

cccataaatt 

ttgaaataga 

gagataatcc 

gcctgcaagg 

gaggtatgga 

tggtggcacc 

tgggtatggg 

aggttcctgc 

ccacttactc 

gtgtggcagc 

ctgtctgtct 

gtctcagtcc 

ctggttttga 

tcgaccagac 

tttgtgtata 

tatttagcat 

tgaaaatcaa 

taatacagtt 

gcttaaccag 

ccaaccacgc 

aacaatcaac 



tctaattaag 

ccttggctcc 

aggccgtgag 

agcccagaga 

tcccggaaag 

caaccaggaa 

cacagaaaga 

gacacaagcc 

atcttggcta 

ctaagctggg 

ggcttgctgc 

aactactcag 

gtggggcaag 

agggttgatc 

tgacctctgc 

aacggagaga 

gcttgggact 

ctccagtttt 

acagcacaca 

tctgaagaag 

actagggagc 

ttttgagttg 

tttactcgag 

caactgctgg 

aaggttgggt 

aaacaagtct 

ggacaagcac 

ctatgttgct 

caaatcatga 

ttcgagacag 

ctggcctcgg 

aggtgtatgc 

atgtcaactg 

ggacaaggta 

cagaggcaga 

gctgtctcaa 

tgggggccag 

aatcagaatc 

catttaagag 

cctgcatccc 

tgggacacag 

ctgaatgtac 

cttatgctaa 

tacacatccg 

aaagatgcct 

catgttgtct 

tagccaccaa 

taaaaaaaaa 

gtttggggag 

ctttgagagt 

caccatgttt 

tcaggtctct 

cagctaagca 

gtcattccat 

tcttggtatc 

gcttgcaatc 

tgggagtagc 

aagatctgat 

caacttaata 

acagcagtac 

cctcgtgttg 

agttaactgg 

taggaaggag 

ataacaggga 



agggctctgc 

gggagctaaa 

tgatgcagcc 

gcgcggctca 

cctgcgcgtg 

ggcttggcgt 

aaagtccatg 

aacaccccaa 

tgaaggagga 

caaaccattt 

ccggagagcc 

attttaaaca 

agaacaattg 

agcttcccga 

accacggctg 

gcagaggaaa 

cacctgaagg 

cccaggcacc 

agttaaatat 

ctgtcatttt 

aggtagtttt 

actataagta 

gctgcaacca 

ggagtagtgg 

gcttccaagt 

tcaaggggcc 

ttctcataca 

tcaacaacca 

ctacagcntg 

ggtttctcca 

actcagaaac 

taccacgccc 

aagttggatt 

ccttgattta 

ggcagaggca 

aaccaaagac 

aacaaggtgg 

gccatcacag 

ctaagtggga 

tccatttacc 

gcttagtggc 

tttcagcgct 

ggagccagaa 

tacagccaaa 

attctggagt 

ttgatgtggt 

gccacaccca 

tgtcccaaga 

gcagtggacc 

ttgtagcttt 

cctctgccat 

ttcagtcctg 

agggctgtgg 

ttatatagtc 

atatattcac 

ctcctgcctc 

aggttctgtt 

ggcccgaccg 

aactcatttg 

ctgttctcct 

cggtgacccc 

aagggttaat 

aggaaagggc 

cagagcagtc 



atttcctagt 

ggttccatta 

cagttaagcc 

ggccctcacg 

cctacactgc 

gggtctgatc 

ggaacaaatc 

ttgctaggct 

tggtgccatc 

tctttttttt 

catcagaaga 

aacagtgcag 

catctgcaga 

ggcaagccca 

actgaagcta 

gtccaatcag 

cagggcattg 

agcccatcgg 

aggctgggtt 

taatgacggc 

ggaagaacag 

tttcacacga 

acaatgagtg 

agggtcctgg 

ctcaaatata 

tggagagtta 

aggaccggct 

gggcagggtg 

gggttttttt 

tatagccctg 

ccgcctgcct 

ggccgagtcc 

ttactgtgat 

actgggcact 

gaaagttggt 

atctttcttg 

tcaaggaaga 

atatagctag 

taaactcagg 

tgtttccagg 

ttagcgctgc 

gcagagcacg 

agtagaagca 

cccacttgaa 

gctaagtggt 

gctagggaga 

caacctctat 

cctcacgaat 

atttggtctt 

gctccccttc 

tatggacacc 

gagtcttatc 

ccgactgctt 

ctgagaattg 

ttaagacaag 

tgcctcaagg 

ctcttagctt 

actcccttcc 

gtaaagccat 

agagcagcgg 

cccccccccc 

aataaaacca 

cactcgcaca 

cttgtaacaa 



cacccgcact 
ggagcacggg 
cgctaacacc 
ccacgagccc 
aaatggacct 
cttcaagagt 
ctcctcttaa 
aactctgata 
ttctgaattg 
ctcttcttaa 
gagctcgctt 
gaggccagag 
aggctagccc 
gaagcctcta 
taaataagcc 
agcttcattt 
agtagagcct 
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ttttctcaag 
taatgcagaa 
ctcctaaggc 
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tttttttttt 
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acacgactgt 
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ccactgactc 
gagattttaa 
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gatctgctta 
tcggggcacc 
cacggctcag 
gctggcaaga 
gtgatccaga 
acaggagggt 
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cagttaactc 
tggtcctcta 
atagcaatga 
gggattatgg 
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atctcattaa 
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gagagagagg 

aacacaagtc 

attaattatg 

tatttatact 

caatttgacc 

aatggaaggc 

tgccttttac 

tagagcccct 

attgagctct 

gtccaagaga 

aatccctcac 

tttttgagta 

ccacgggttt 

ggattggttc 

tgaccgaatg 

ccatctctaa 

aacgctttgt 

gaatcaagtg 

tactaaacca 

aaagatgcca 

ggagacagaa 

aagctaccca 

cttgtatttc 

acatagggaa 

agggtgcatc 

gtgagggccg 

tgggtggccc 

gtttccaagt 

ccagtatcaa 

acacacagag 

ggctgccaca 

cagcagcagc 

agcaactgaa 

tgcacacatc 

gcagagatgt 

ttaggaacac 

gttttcttcc 

taaacatcaa 

ccttttgtgt 

atctccaggc 

tatttattat 

aagtaggatc 

tcttaaattc 

ggtaactgat 

tgtctgcagg 

caagtaactg 

ttccccaagg 

taatatcaga 

cagatattaa 

caaccatgca 

cacctgggtt 

cactggaaaa 

acttgtgaca 

aggcaggagg 

acacacagcc 

tgggttgcca 

aagccattct 

tcacatgttc 

cttcacgtcc 

tttgtgagtg 

tcaatgagtc 

tcctggagtt 

tcagacctag 

agcagacagg 



ctgagtttct 
aatataaacc 
tagtcaatag 
tttcgaagac 
ctgtagttca 
ataaaacact 
aaagagactg 
ccctcccctc 
aagtgataga 
attcaaagaa 
aggcgttaac 
gggaaatctg 
ctacatgcag 
agcgacctca 
ctgtgagcag 
cctgcaattt 
aagcaaggtc 
taactcaatc 
gtgagcctca 
acagctaagg 
acaggaggat 
cttgaagact 
tagtgtatga 
gatggtgcct 
gatctaaatc 
ttccagagat 

aggggtccag 

attaggacca 
gcttcaggcc 
cactagcatt 
gtgagagtcc 
catttctcca 
acatggggtg 
acaggccagg 
gcagcagatg 
actgagcagg 
tcgatcacgg 
tccaaccccg 
tttcattcat 
ctcagtctaa 
gaaggctaat 
cacaaagata 
tatttaacat 
gcaaactgaa 
ctttgtcgga 
accccactat 
ggacagcaag 
gtcatctgac 
cgataggcct 
gtaactctgc 
ctcccctact 
gcacactcca 
taatttctgt 

aggggactgc 

ctctgagctc 
gcctccccta 
atttctgttc 
ttcctgatgt 
atcacagtgg 
gcagagcctg 
cagcaggaag 
gggccagatc 
catcctcccg 
taactcaccc 



acttctataa 
tttagacatg 
ccatgggttt 
aagccactca 
aaaaaagtca 
ggaaatatgg 
gagggaattc 
agctgcttaa 
tgagaatcag 
ctgtgggcca 
agtggaacca 
ttaaacatcg 
ctgtgctaga 
gaggacattc 
gagatatcaa 
gtcaatctca 
agatagaggc 
cctatctcct 
agcaaagcct 
gccagggatg 
caggacttca 
gtccccaaca 
gacgaaggaa 
caaaaacagg 
cacatacctg 
ttaataaaga 
actgaataaa 
cagcctggcc 
agcggcaggg 
cacctcctgg 
cctaccttcc 
gtaacttgct 
ctgtgacccc 
cctgaagatg 
ccaagaggtg 
catgctcttg 
tctccacgtt 

aggggccaga 

ttaaagatgc 
tctctgagtc 
taattgcttt 
tactgtttat 
agctttaata 
gccatgctct 
tttactaaga 
gtggataatg 
ccagtgctta 
tcagtctatt 
ttgattaata 
aaaacagacc 
cttctccccc 
ggttacataa 
ttgaagataa 
tgccacacta 
actgtctttg 
gaggatcttc 
ttctctcttc 
tcgcgatcag 
ccttaattcg 
gcgtgcagcg 
aaggaaagtt 
ctcttccaca 
gagaagcctc 
taaagcatca 



ataaaccctt 

gggctgccaa 

cattagcgta 

gggaaaaaat 

gaacagcaca 

acagaaatca 

agaaactatt 

cactggggtt 

aacaggaaca 

gaatctactt 

atttccaagg 

tcccacgagg 

tctgctgaag 

ttgttcacta 

aggccggcta 

gacagcaatg 

tccataaaaa 

gagattaggg 

gtctgttctc 

tagtgcaggg 

aggtagtttg 

aaacaaacaa 

gaaagctcaa 

acagccgagg 

gatttaaagt 

cccaccctga 

gggaaaatgg 
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cactggacag 

tctcttcttg 

tcaccgttaa 

ttccacagat 
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ctgggggact 

ggctgcagca 

ccgaatgaaa 

tcagagttgg 

tcatcggtgt 

attccagggt 

tgtaatgagt 

ccagttacaa 
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gtagcagcct 

ttctgttatc 

aagtaaatta 

tcagccgtcc 

aaacctatca 

attctacctt 
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cttatcagtg 
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cgttctggcc 
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tgtctttcta 

ctttcatcta 



ggcaggcgga 

acttcacttt 

ttaaatacca 

ggtgggggga 
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gtggtggacg 

gtgtttttga 
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gagcccagct 

tggccggtga 
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ctggactgaa 
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ttgttcaggg 
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agcagacaga 

aatatctggg 
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gaggaagttc 

gctggccagc 
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ggatggtaca 

actgttatag 

aaagccatgg 
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tcctgggctc 
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taacatcttt 
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ggactgaggc 
tccatcccag 
ccagcctacc 
ggtgataacg 
cacttgtctc 
cacaagtcta 
ccgctgttta 
ctccatggct 
aagcttgcaa 
ctagatagat 
ctggacatca 
tttgtttaat 
annnnnnnnn 
nnnnnnnnnn 
ctcctgctgg 
cacagtgtca 
gaactcgagt 
gattccctgg 
tgtaccgatg 
gagaacagca 
gagggggggt 
aagttccaca 
agaaacagat 
ttttctcctt 
tatatctaag 
cattatggat 
agaacagtca 
tgagttaggt 
caatgatact 
ttgctggtct 
gttatgtacc 
gtttctctgg 
ctcagaaatc 
cccagtttaa 
ctggatgttc 
cctctgcctc 
ttttttaatg 
cccagagcgg 
ttttaaaggc 
cacacccgta 
tgctggtctg 
cagtttgagc 
gcagagaagc 
aacaggaaat 
cttctctact 
aaattaaact 
gcctggttct 
gcctcactag 
tggcctgttc 
cagaaccaaa 
ctcttcagca 
acgtgcggtt 
tggagctcag 
tgcggagggg 
ctgcacccta 
cactgcgagg 
aatggcccag 
cccttcctca 
tgatgcccgg 
gcgttggctg 
gggtgaagca 
gactcgagtg 
cgtgcccttt 
ggctgcccag 



ttctgctcca 
gaagacagaa 
tatgttcatt 
cagaatgcac 
cttgaactaa 
gctgggggcc 
caagtatggt 
ttaggttcag 
caacagctcc 
ggggcaggac 
ggattagttt 
attctgagtt 
nnnnnnnnnn 
nnnnnnnnnn 
ggttacaggc 
agctggatat 
tggtgaaata 
tcacatgctt 
catgcttgca 
agtgaacaaa 
gtaaagctct 
tgtaaggtca 
gcagtctgtc 
cctttctttt 
tacactgtag 
ggttgtgagc 
gtgctcttaa 
tttgtgtagc 
ctgcctgatc 
ggaactaaca 
accacacctg 
gtagtcctgg 
cgcctgcctc 
gggttttttt 
tggaactcac 
ccaagtgctg 
tgtgtatttg 
aagtgttctt 
accaaactct 
cacacccgta 
gcctgtgtat 
acaggctctc 
atctgcaagg 
ctcaggagag 
tcccagaaca 
ccttggtttg 
gctgctggca 
atccgcgtct 
aggatgatct 
cgagcctcgg 
ggggtgggga 
gtttccgtgt 
cgctggtgct 
taagaaaacc 
ccccccgctc 
agggccgtgg 
gggagcagaa 
ggacctgaca 
cagctggccc 
gctagtctcc 
gaagtcagga 
gaggtgcatg 
gaggaggggt 
aattaccgga 



gctctgggca 
aatccctaca 
taatacaaca 
aaactctgcg 
acagtgtctt 
ttcatgcttc 
taggagcccg 
agtcattctc 
aacacagaaa 
ggactgtcca 
aaacacttgt 
ttagaaccta 
nnnnnnnnnn 
nnnnnnnnnn 
ttgagccacc 
gatggcacat 
gtccaccagg 
taaggaaaga 
gccacccaca 
caaacaagcc 
tggcaggtgg 
tgtgatcgag 
atatctaaac 
ttctctccct 
ctgtcttcaa 
catcatgtgg 
ccgctgagcc 
cctgggtggc 
tgtcttaatc 
gagatctgcc 
gagtttaagg 
ctgtcctgga 
tgcctcccaa 
ttgtttgttt 
tctgtagagc 
ggattaaagg 
tatgggtgtg 
aaccgctgaa 
taggtaggag 
cacaccacat 
gagctggaac 
tggtttctgt 
ctgtgaccac 
tatttccttt 
ttgtgtgcca 
cagagtagca 
gagctgacac 
cagagtctgc 
tcctctgcat 
acgaagcccc 
caagagcaag 
ctgggagatc 
tcgcgctccc 
cccaggcttt 
ctcgttcatc 
ccaggctcag 
ggcggaaagt 
tggaggagct 
tggcgccggg 
gggagcgccg 
ctttactgca 
ggctgcggaa 
cctgtagccc 
acctgtggcg 



aggttacttc 
ctctcccttg 
actaaaatat 
gctgcagggg 
tgaggcagaa 
catgtgctta 
actgcctggg 
tgcatgcctc 
gtgctgtgag 
gtaagcaggg 
caggtggggc 
aactgtggga 
nnnnnnnnnn 
nnnnnnnnnn 
atgacagctt 
gcttgcaata 
taaaagagct 
acttgccgaa 
cacccacaca 

gggggggggg 

ctggctcctg 
tacatctggg 
cattgttgtc 
ttctctttta 
acacaccaga 
ttgctgggat 
atctctccat 
gttaccttaa 
attttgagat 
tgtttctgcc 
gttttttgtt 
actcgctctg 
gtgctgggat 
tttcctgaga 
aggttagcct 
cgtgtgccat 
tgggtgcttg 
ccatctctct 
actatacaca 
gaccatgcct 
caaaaccttt 
tctctgtcct 
gctggctggt 
taagaacgcc 
agtggcaagt 
tgggagcatt 
ttggctaaag 
aggagaaatc 
ttaagggcgg 
ctaaaggcag 
aggcgggatc 
acatgacccg 
cgccctgctg 
ctttcctttg 
ccagtcttcc 
ccttgcgctg 
ggttcttaca 
gctccggagc 
gggcactcat 
actgggaccc 
acgttcggta 
acggagactg 
tgaaactctc 
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tctgctcttc 
atctacccga 
ctattcacag 
agacggcaga 
cagggtgaca 
gtaattaatt 
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agacttttgt 
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gagagggtgt 
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gtgtcagatc 
ctaactctga 
cacacacaca 
gtggttttat 
ccgcagtctg 
ttgactagag 
ctacattttc 
gcctgggcca 
gtgctttgga 
atgcctaaag 
tctgagatga 
gaaggtccag 
cacgtagccc 
tgagccttgg 
ctttagagac 
cccgtcacgg 
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gcgcagtcac 
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cagccacagc 

gagggtgtat 

cagggatggg 

ggctttgctg 

gcatgatgta 

tcttctcttc 

ggctctgtct 

cctgagcgct 

tgggacaggc 

gaaagaactt 

cctcatgcag 

gctgccctgg 

caagttccga 

ggcatttgtt 

tgacgtgctc 

gctctgtgga 

gatgcagacc 

ttgctctatc 

ggaatttgtg 

tcactggatt 

gaatgtgtgt 

gctatttgat 

tccactcatc 

ttcaggacaa 

tggctggact 

tctggattcc 

agcatcacct 

caaagctgcc 

cttcagtccc 

cctagggagc 

gagcctcttc 

caccagggtc 

gggtctctgc 

actcctacag 

actgtttgag 

cagccccttc 

tcttttatat 

tctgtggcag 

cctgcctttc 

cctatttgaa 

gcctctcatc 

cgactgtttt 

gctgccccat 

cctggtcggg 

tgcctttggg 

cccagactac 

cttccaagct 

cctgggccag 

tggggatgat 

ggacttgaag 

ggaggaggac 

gctgtccatg 

gccgctgaca 

ctgagggggc 

accagatcca 

ctgtgtgtga 

aaagccatac 

tactcgaggg 

aagtggtggt 

aagagtgttc 

gagaggcagg 

ggtgcaggcc 

gccagcctgg 

tcgaaaaaaa 



actgccccct 

ggatgcacct 

ccctgcccct 

gagtcgcccg 

gttaccttca 

cgtgttctga 

ttgcaccaca 

tacgagatgc 

attaagtctg 

cggggccttg 

ctgaatcggt 

gtggtggact 

ctcaacaagg 

gcaggtggtg 

tctgacatca 

catgtccgag 

tggacaccgg 

caccctgaca 

gctgcccatc 

gatcttacct 

ctgcacctgg 

cagccacacc 

ccccggctgt 

cttataaatg 

agagataggc 

atctccctcc 

ggcctgttgt 

gggttggacc 

atacaggcct 

cagttggagg 

catcgagaca 

cggatactgc 

atacgccact 

ctgagcggac 

tataggccgg 

agctccgtgg 

caggcccggc 

cagctgggtg 

gtgctgtcgc 

cccgttgcca 

ggtgcctatg 

gtggcccagt 

gtcctccagg 

accactgagg 

gaagagattc 

aggtcgggcg 

ggactctacg 

ctgagtgata 

ggcggtgccc 

cagagcgaag 

caggaggatg 

gagacggtgg 

gagcagacag 

atgggtcagg 

ctgtagcacg 

gtggaaagtg 

cttgggctga 

ccctgggtgt 

tgagatgtga 

cagacaggga 

gaaaggctaa 

tttaatccca 

tctacaaagt 

aaaaaaaaaa 



cagctctacc 

tcttgccagt 

ctcggggcag 

agatgctcta 

gccctgccaa 

gggccatgga 

ttgctgtaga 

cttccgagga 

aaaaagaggg 

tgctagactg 

tggcaggtcg 

ttaccacacc 

gagataagca 

caggaagtgg 

cgtactatgt 

cgcagtggga 

atgagtgcat 

tgcccgacct 

gagccctcct 

ttggctacaa 

tggacgctca 

cccaacgcct 

tggtccagcc 

gtgcgggcag 

ctgggacagg 

ccgggaaagc 

ctttttctgc 

ctggggaggg 

tggaagagct 

agcctgaaaa 
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AT t a a o 

211320 


ggatggagtg 


All ~> ft /*\ 

211380 


agggtgggtg 


211440 


gagttccgac 


AT T C 

211500 


agtggccgga 


211560 


acgggcctgg 


211620 


actgactgcc 


211680 


ggtttatgct 


211740 


ctcgtaggcc 


211800 


tagaactgac 


211860 


cttgcttgga 


at t r\n n 

211920 


aggaggctgt 


at t c\ a r\ 

211980 


ctgatggggg 


212040 


tcatagacct 


on at r\r\ 

212100 


atttctccct 


AT AT Ct\ 

212160 


actgttgctt 


AT A A A A 

212220 


gggtctcagt 


212280 


tcctgggtgt 


212340 


aagtaaggac 


212400 


gctctggctc 


AT A A ^ A 

212460 


gcctgcacgc 


212520 


ttgtacctca 


^ t ^ C ft o 

212580 


tatttattta 


212640 


cacaccatta 


212700 


agcctggtct 


212760 


aaaaataaaa 


212820 


tattagccct 


O t *"» ft ft /\ 

212 880 


aagatcctcc 


AT A C\ A A 

212940 


actattttct 


AT A A A A 

213000 


cagtgctaat 


213060 


ttgggagtca 


213120 


ctccagcccc 


AT AT OA 
21olOU 


ttgttttgtt 


213240 


tgtagaccag 


213300 


aattaaaggt 


213360 


tacaggtttt 


213420 


gttttatttg 


213480 
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ttttgagaca 
ctgactttaa 
tctgatggtt 
atatcaccca 
cacagagatc 
ggtgagtacc 
gggcagcgtg 
acagaagccc 
cggcagcgag 
gccaccctct 
tctggctttg 
cctcctggca 
ggcccgctcc 
tacccacatg 
gcttcatgaa 
tgtgatcccc 
aggaaatgaa 
ctctgtttct 
ccaggctcag 
attcattcag 
gggaaggggt 
cctaaccaca 
ggacaggagg 
tctgccatct 
actagttgca 
tgaggctgtg 
ccaaggagct 
tggcttggct 
ccttcctgct 
ttttttaaag 
caccatactt 
cgaagttaat 
gcaagagcag 
ttgatacaag 
cctacctacc 
catgcccacc 
aaacacagca 
tgggttgggt 
gctgaagagg 
tggaacctag 
gctcctgcag 
tggtggtggt 



ggtctcactc 
acttgttttc 
gttaataaca 
gacagtggct 
tgcctcccga 
caagcctcca 
ctcatcagct 
acgcaccact 
gtggtcaccg 
caggccacca 
ctgcccacga 
tagggccagc 
tcattcttga 
gcctgctgtc 
actcgggaga 
attgctcccc 
aggagttggg 
cccttgggta 
gagggctggc 
ccaggtcttc 
tatatgggcc 
gcgtttgtgc 
cagcgggaat 
cttgagtgtt 
gtctatgtaa 
ttgtgttttg 
ggggaactgg 
ttttttcaga 
tgaaacagag 
gca ttgattt 
gtgtctggtc 
gaatggttat 
ccagtgttct 
gtctggtagc 
ctcttctctc 
agtttgcctt 
agagggtaga 
ggccttgggt 
tgggaccctg 
gctggaggga 
cctccccttc 
ggtggtggtg 



tgaaccccta 
tgagtgctgg 
atattattta 
ggtctggaaa 
gttcagaaat 
ttgatgctat 
cctcttccga 
acaagtcagc 
gcactgtagc 
caaagctcag 
aacgccacct 
caggagttgg 
tctcgaagcc 
tgggattgca 
tactgttcct 
agttctgcct 
cctttgctca 
ggaaactgct 
caggacctac 
agtatgggac 
cagctggtcc 
cttcagccgg 
catctctgta 
cgtttggaga 
ataaggtcaa 
ctttgcctag 
tatatgtcac 
tggtctcaag 
tagtgggctt 
ttatacgtgt 
cctgcggagg 
gagccacatc 
tacccactga 
tcaaacttga 
taccttccaa 
gatggcacca 
ggcagccatt 
cagttttccg 
ggagggccgg 
gccaaccagg 
ccttgggtag 
gtgggggggg 



ggtggcctgg 
atttatgggc 
gcttttcttt 
tcactgtgta 
taaaagcact 
ctgtccctcc 
ccattccttg 
gtccgaccca 
caacaagatt 
ttccgagaac 
cctgctgggc 
ctgagggcag 
gattcttcta 
cagctcctga 
aaccagcaag 
tctggattca 
gccagcttcc 
cctgtctagg 
gactttcagg 
cagaaaaaag 
aggaatgaat 
ggaggcagcc 
tctcgggttc 
agcctgagat 
taaattcttt 
gctgggctca 
tgtatatgtc 
tgcctcaggt 
ataggcatga 
atggttgttt 
tcagaagagg 
tcgatgctga 
gccatctcta 
tatgcagccg 
gtgctgatat 
gagtcaggaa 
gtcctctggc 
aatggttcac 
cctccagctg 
atcctggact 
ctgtggtggt 
gggngnngnt 



agcttgctat 
ttgtgctatt 
tggagatagg 
ggccaggctg 
ctgggatggt 
cgctctctgc 
actgtttgga 
atccacacct 
ggtgtctgtt 
ttccgt.ggca 
tcggacaatg 
ggcgagatga 
ggcaagcccc 
atctccaaag 
aattggggca 
catggggaca 
tctagccacg 
gttctgatgg 
gcttggtctg 
gggatgtgag 
ccatgccttg 
cttgggacca 
tgcccagggg 
gtggcccctg 
ggaagagcca 
ggcagctctg 
actgtgcctg 
tggccttgat 
cccaccaggt 
tgcccacttg 
gcatcgggat 
agattgaacc 
agccccacac 
aggaggttga 
tatacatagg 
agtccaaacc 
tgccttggat 
ccttggggaa 
ggtctctcca 
tgctacagtt 
ggtggtggtg 



gtagaacaca 
ttgcccagcc 
ctctcactgt 
accttgaatt 
tttggagttt 
aggctgtaga 
aggagctgga 
ttgacctgta 
ccctgcttga 
cgctcactag 
gcatcatccg 
catctctcag 
aggctctggc 
ccttgaagtg 
aggaaagcac 
gggcagctcc 
ctctccttag 
tactgggact 
gggttttagc 
aacagggcta 
ccttggtacc 
gcatccctag 
atgggcagac 
ctgccttctc 
cggagctgag 
cctcagcctc 
gcttatggct 
cttgggatga 
ccaattttta 
tacatatgca 
cacctggaac 
tggatccttt 
ccagcttctt 
cctggtattc 
catggatagt 
tggtagttgc 
acagagcttc 
agggaacact 
gccctcgcct 
aggtgaacag 
gtggtggtgg 



213540 
213600 
213660 
213720 
213780 
213840 
213900 
213960 
214020 
214080 
214140 
214200 
214260 
214320 
214380 
214440 
214500 
214560 
214620 
214680 
214740 
214800 
214860 
214920 
214980 
215040 
215100 
215160 
215220 
215280 
215340 
215400 
215460 
215520 
215580 
215640 
215700 
215760 
215820 
215880 
215940 
215980 



<210> 17 
<211> 473 
<212> PRT 
<213> Mus sp. 

<400> 17 

Met Lys Arg Ala Ser Ser Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
1 ~ 5 10 15 

Trp Leu Gin Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val 
20 ~ 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Thr Gly lie Pro Ala Ser Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Gin Ser Cys 
65 70 75 80 
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Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Ala Leu Ala Arg lie 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu His Val Val Asp Pro Thr Thr Phe His Gly 
115 120 125 

Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Arg Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 " 160 

Leu Gin Asp Asn Asn Leu Gin Ala Leu Pro Asp Asn Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Pro Ser 
180 185 190 

Val Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn His Val Ala Arg Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met 
225 230 235 240 

Leu Pro Ala Glu Val Leu Met Pro Leu Arg Ser Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
260 265 ^ ~ 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn 
275 280 285 

Leu Pro Gin Arg Leu Ala Asp Arg Asp Leu Lys Arg Leu Ala Ala Ser 
290 295 300 

Asp Leu Glu Gly Cys Ala Val Ala Ser Gly Pro Phe Arg Pro lie Gin 
305 310 315 320 

Thr Ser Gin Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys 
325 330 " 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Thr 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His lie Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr Ala Leu Arg Pro 
385 390 395 400 

Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg 
405 410 ~* 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
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420 425 430 

Gin Ala Gly Ser Gly Ala Ser Gly Thr Gly Asp Ala Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ala Leu Ala Cys Ser Leu Ala Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 
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(57) Abstract: The invention relates generally to genes that encode proteins that inhibit axonaJ growth. The invention relates specif- 
ically to genes encodign NgR protein homologs in humans and mice. The invention also includes compositions and methods for 
modulating the expression and activity of Nogo and the NgR proteins. Specifically, the invention includes peptides, proteins and 
antibodies that block Nogo-mediated inhibition of axonal extension. The compositions and methods of the invention are useful in 
the treatment of cranial o cerebral trauma, sprial cord injury, stroke or a demyelinating disease. 
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Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 
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"M. musculus chromosome 11 clone 
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retrieved from EMBL 
Database accession no. AC021768 
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with aa 21-461 of seq. 4 and aa 1-392 of 
seq. 14 
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"E* earlier document but published on or after the International 
filing dale 

"L" document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

•O" document referring to an oral disclosure, use, exhibition or 
other means 

•P* document published prior to the international filing date but 
later than the priority date claimed 



T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

•X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

•Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
In the art. 

"&" document member of the same patent family 



Date of the actual completion of the international search 
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Date of mailing of the international search report 

08/11/2002 


Name and malBng address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280HVRQswijk 
TeL (+31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 
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Galli, I 
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Category « 



Citation of document, with indication .where appropriate, of the relevant passages 



Relevant to claim No. 



P,A 



DATABASE EMBL NUCLEIC ACIDS 'Online! 
15 November 1999 (1999-11-15) 

"Homo sapiens clone RP11-10E2" 
retrieved from EMBL 
Database accession no. AC013606 
XP002217419 

compare aa 168-420 of seq. 2 with the 
translation product of nt 138578-139331. 

DATABASE EMBL PROTEINS 'Online! 

1 May 2000 (2000-05-01) 

CORBY N. : "New leucine-rich protein" 

retrieved from EMBL 

Database accession no. Q9UGS3 

XP002217415 

compare aa 352-399 with seq. 11,19 

DATABASE EMBL PROTEINS 'Online! 

1 October 2000 (2000-10-01) 

0SADA N. ET AL.: "Unnamed protein product" 

retrieved from EMBL 

Database accession no. Q9N0E3 

XP002217416 

compare with sequences 2,4,5,11,17,18,19 

WO 00 31235 A (CHEN MAI0 S ; SCHWAB MARTIN 
E (CH)) 2 June 2000 (2000-06-02) 
the whole document 

WO 01 51520 A (UNIV YALE ; STRITTMATTER 
STEPHEN M (US)) 19 July 2001 (2001-07-19) 
NgRl: compare seq. 1,3 with seq. 5,17 of 
present application, respectively 



1-4,6-30 



1,11,15 



1-30 
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FURTHER INFORMATION CONTINUED FROM PCT/1SA/ 210 



Continuation of Box 1.2 



Claims 1,2,6,7,9-13,15,16,18-30 relate to an extremely large number of 
possible compounds. It is not possible to determine all the 
possibilities, as any given X can be any amino acid or even a gap. In 
fact, the claims contain so many options, variables, possible 
permutations and provisos that a lack of clarity (and/or conciseness) 
within the meaning of Article 6 PCT arises to such an extent as to render 
a meaningful search of the claims impossible. Consequently, the search 
has been carried out for those parts of the application which do appear 
to be clear (and/or concise), namely seq. ID 2, 4 and 14. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. fxl Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 27, 29 and, as far as methods in vivo are. concerned, claims 26 
and 28 are directed to a method of treatment of the human/animal body, the 
search has been carried out and based on the alleged effects of the 
compound/compos i t i on . 

2. [* Claims Nos.: 

— because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Claims Nos.: 

— because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This international Searching Authority found multiple inventions in this international application, as follows: 



□ As all required additional search fees were timely paid by the applicant, this International Search Report < 
searchable claims. 



2. [3| As a N searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
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NOGO RECEPTOR HOMQLQGS 

FIELD OF THE INVENTION 

5 The invention relates to neurology and molecular biology. More particularly, 

the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism communicate 

10 with each other and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surface. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
are a first essential link for translating an extracellular signal into a cellular 

1 5 physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 

20 distal tips known as the growth cone. Growth cones enable the neuron to sense the 

local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the neuron toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sensitive to several guidance cues, for example, 
surface adhesiveness, growth factors, neurotransmitters and electric fields. The 

25 guidance of growth at the cone depends on various classes of adhesion molecules, 
intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 

30 elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues (e.g., Richardson et aL, (1980) Nature 284, 
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264-265) demonstrated that CNS axons can in fact elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, given the appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 
5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an important barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 

10 oligodendrocytes that synthesize myelin in the CNS (Schwab et al, (1993) Annu. Rev. 
Neurosci. 16, 565-595).' Myelin-associated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an interruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 

15 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David et al, (1999) W0995394547; Bandman et al, (1999) U.S. Patent 
No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 

20 (MAG), which have inhibitory activity for axonal extension, have been described as 
well (Schwab et al., (1990) WO9005191; Schwab et al, (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the effects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. 

25 Neurol. 109, 2-5). The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested function (Nagase et al, (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et al, 
(2000) Nature 403, 439-444; Chen et al., (2000) Nature 403, 434-439; Prinjha at al., 

30 (2000) Nature 403, 383-384). 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective 
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expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et aL, (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spillmann et aL, (1998) J. Biol. 
5 Chem. 272, 19283-19293). Antibodies {e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann etaL, (1998) J. Biol Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for inhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 

10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et aL, (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be expected to improve axonal 

15 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab et aL, (1994) W094 17831; Tatagiba et aL, (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 
aL, (1993) U.S. Patent No. 5,250,414); Schwab et aL, (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, infarction and 

25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab etaL, (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proc. Natl. Acad Set 87, 4130-4133; 
Keirstead etaL, (1992) Proc. Natl. Acad. Set 89, 11664-11668). After Nogo is 

30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 

Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and NgR3 genes have been identified and compared to domains in mouse 
5 and human NgRl polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX U V 
X 12 C X 13 X 14 P X 15 X 16 X 17 X lg X 19 X 20 D L X 21 X 22 L X^ X 24 X 25 D 

15 

X26 X 27 X 28 C [SEQ ID NO: 19] 



wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

Preferably, XI 7 and X23 are (independently) arginine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-310 of 
SEQ ID NO:2, or residues 261-310 of SEQ ID NO: 2 with up to 10 conservative 
25 amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino acid sequence: 

C P X j X 2 C X 3 C Y X 4 X^ P X5 X 7 T X 8 S C X^ X 10 X n X 12 X 13 X 14 X 15 X 16 P 
X 18 X 19 P Xjq X 21 X^ X^ R X 24 F L X^ X 26 N X 27 I X 28 X^ X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 3S X 39 X^ X 41 X 42 L W X 43 X^ S N X 45 X^ X 47 X^ I X^ 
30 Xjo Xj! X5 2 F X 53 X^ X 5S X^ X5 7 L E X 58 LDL X5 9 D N X^ X 61 L X^ X^ X^ 

^65 P T P Gr L X^g X 69 L X 70 X 7 j L X^ L X^ X 74 C X 7S L X 76 X^ L X 78 
X 79 X 80 X 8l F Xgj G L X 0 X 84 L Q Y L Y L Q X 8S N X M Xg, X 88 Xg, L Xso D 
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^F^DL^NLX^HLFLHGN^ Xc> 8 X, 9 X 100 X 101 X 102 
X103 X 104 FRGL X 105 X 106 LDRLLLH X 107 N X 108 X 109 X 110 X 1H V H X 112 
X n3 A F X U4 X u5 L X 116 R L X 117 X llg L X 119 L F X 120 N X 121 L X 122 X 123 L 
■^124 X 125 y». i26 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 135 W 
X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 151 X 152 X 153 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
D X 161 X 162 X 163 C [SEQ ID NO: 18] 



wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 

10 polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, Xg, X 37 and X 38 may represent a gap. Specific examples of polypeptides of 
the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 

15 CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a functional GPI domain, a functional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of the invention is also provided. The 
method includes introducing a nucleic acid encoding the above-described polypeptide 

25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is complementary to a nucleotide sequence encoding a polypeptide selected from the 
group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a 

30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic acid encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 
5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutically acceptable carrier. 

The invention also provides a method for decreasing inhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous system disease, 

disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated using molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, 
Marchiafava-Bignami disease, Spongy degeneration, Alexander's disease, Canavan ! s 
disease, metachromatic leukodystrophy and Krabbe's disease. 

The invention also provides a method for identifying a molecule that binds a 

20 * polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 

25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 

30 apparent from the detail description and from the claims. 

BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A- IB shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID 
NO:4) with the known NgR, NgRl (SEQ ID NO:5) and the Consensus Sequence 
(SEQIDNO:6). 

5 

Fig, 2. mNgR3 does not bind hNogoA(1055-1120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-1120). 

10 Fig. 3. An alignment of the amino acid sequences of human NgRl, murine 

NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
acid #1 of murine NgR3. The consensus sequence is listed below. The LRRNT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 

1 5 boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides (e.g., 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR homologs, referred to herein as NgR. Unless indicated otherwise, as used herein, 

25 the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by 
number, e.g., "NgR2" is a human NgR homolog, "NgR3" is a murine-derived NgR 
homolog, and "NgRl" is the known NgR identified by Dr. Stephen Strittmatter. 

30 Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA, cDNA and DNA that has been chemically synthesized in whole 
or in part. 
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Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skill in the art include Ausubel et al, CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et al , Molecular CLONING: A LABORATORY Manual, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1989); Kaufman et al., Eds., 
Handbook of Molecular and Cellular Methods in Biology and Medicine, 
CRC Press, Boca Raton (1995); McPherson, Ed., Directed Mutagenesis: a 
Practical Approach, IRL Press, Oxford (1991). 

As used herein, the term "axon" refers to a long cellular protrusion from a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth" refers to an extension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
pathological state associated with abnormal function of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS function resulting from 

physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target cell. 

As used herein, the term "neurite" refers to a process growing out of a neuron. 
As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose function is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "region" is meant a 
physically contiguous portion of the primary structure of a biomolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 

10 addition to all or part of that region. Examples of NgR protein domains include, but 
are not limited to, the signal peptide, extracellular (i.e., N-terminal) domain, and 
leucine-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, i.e., 

15 having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar functions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 

20 Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay 

system in increasing concentrations to inhibit the binding of Nogo to NgRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 

25 and/or NgR3 as a biological indicator of NgR function. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab 1 , F(ab)2, and other fragments thereof. Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 

30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 

10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 

15 salts, solutions etc. Contacting includes, for example, placing the compound into a 

beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or 
fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereof refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of proteins. Such isoforms can be expressed in different tissues of 
the same organism as a result o£ for example, alternative splicing of RNA. 
25 Alternatively, isoforms can be encoded by different genes. Homologous nucleotide 

sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl. Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
acid substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison WI), using the default 
5 settings, which uses the algorithm of Smith and Waterman (Adv. AppL Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic acid that has been removed 

10 from its native environment. Examples of isolated nucleic acid molecules include, but 
are not limited to, recombinant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an 
"isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 

15 sequences located at the 5* and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 

20 nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herein, the term "heterologous" refers to a nucleotide or amino acid 

25 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different species. For example, a mouse NgR nucleotide or amino acid sequence 
is heterologous to a human NgR nucleotide or amino acid sequence, and a human NgR 
nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide 
or amino acid sequence. 

30 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 
NgR2 and NgR3 polypeptides that lack a sufficient portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of 
NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR 
5 polypeptides include full-length or truncated (e.g., with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, 
and from amino acid 1 to about amino acid 438 of NgR3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g., all or part of the 
10 signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID 
NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID NO:4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID NO:8) and the mature NgR3 (SEQ ID 
NO: 9) are used. 

15 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich region (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO: 1 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID NO: 2, or amino acid 1 through about amino acid 

20 438 of SEQIDNO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 
40 through about amino acid 438 of NgR3, the C-terminal 10 amino acids of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 

25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal elongation through inhibition of the Nogo ligand binding to NgRl, 
or by inhibiting the ligand of the NgR from interacting with cell surface NgR. The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 

30 FcyRI (macrophage) and Fcylll (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides useful in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 



BNSDOCJD: <WO. 



_022905SA3_IA> 



WO 02/029059 



PCT/USO 1/3 1488 



- 13- 

be derived from intact transmembrane NgR molecules by proteolysis using specific 
endopeptidases in combination with exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 
5 recombinant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression. 

Preferably, the soluble polypeptides useful in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 

1 0 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are expressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences 

15 coding for the desired peptide also can be derived from the full- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR synthesis of the 
specified region from cDNA 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 

20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs: 1 or 3 as a hybridization probe, NgR nucleic 
acid sequences can be isolated using standard hybridization and cloning techniques 
{e.g., as described in Sambrook et al y eds., MOLECULAR CLONING: A LABORATORY 

25 MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

1989; and Ausubel, et al 9 eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John 
Wiley & Sons, New York, NY, 1993). 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

30 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates 1 ' or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

1 5 depending on use. They are used in the detection of identical, similar or 

complementary nucleic acid sequences. Longer length probes are usually obtained from 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefully designed to have 
specificity in PCR, hybridization membrane-based, or ELISA-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors 

25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 

some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of 

30 degeneration; (d) relieving to some extent one or more of the symptoms associated 

with the abnormal condition; and (e) enhancing the function of the affected population 
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of cells. Compounds demonstrating efficacy against abnonnal conditions can be 
identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and inflammation. 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
1 5 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protein kinases are associated with the apoptosis pathways. Aberrations in the 
function of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist within the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in cell 

25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells outside of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microinjection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA. In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 
10 from left to right. The amino and carboxy groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5 1 to 3 1 direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof. It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 

20 transcripts that undergo one or more splicing events wherein intron (i.e., non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 

25 invention therefore are encoded by the same original genomic DNA sequences but 
arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 

30 opposed to non-naturally occurring variants arising from in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementary strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13 . A preferred DNA of the invention comprises a double 
5 stranded molecule comprising the coding molecule (i.e., the "coding strand") along 

with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA. Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, 
10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR. More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
15 sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR. 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgR DNA. Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 

20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Generally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 

25 NOs: 1, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 

30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 

BNSDOCtD: <WO 0229059A3_IA> 



WO 02/029059 



PCT/US01/31488 



- 18- 

Examples of related polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 

5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PCR analysis and are useful in animal models for NgR disorders. Knowledge of the 
sequence of a human NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PCR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 
10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express NgR. 
Polynucleotides of the invention may also provide a basis for diagnostic methods useful 
for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or 
states, which information is useful both for diagnosis and for selection of therapeutic 

15 strategies. 

The disclosure herein of a full-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the full-length polynucleotide. The invention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 

20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only (i.e., 
"specifically") to polynucleotides encoding NgR (or fragments thereof). 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length 
sequence derived from introns, regulatory regions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 

30 through use of alignment programs routinely utilized in the art, e.g., those made 

available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly useful as probes for detection of 
5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding NgR polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 

1 0 complement, of the polynucleotide in any of SEQ ID NOs: 1 or 3 . 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1989), 
6.3. 176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 

1 5 hybridized to each other. A non-limiting example of stringent hybridization conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon 
sperm DNA at 65 °C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the invention that 

20 hybridizes under stringent conditions to the sequence of SEQ ID NOs: 1 or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 
As used herein, "stringent hybridization conditions" means: 42°C in a hybridization 

25 solution comprising 50% formamide, 1% SDS, 1 MNaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0. 1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgR. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 

5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced {e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 

10 into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 

15 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out in K coli with 

20 vectors containing constitutive or inducible promoters directing the expression of 

either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: (1) to increase expression of 
recombinant protein; (2) to increase the solubility of the recombinant protein; and (3) 

25 to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 

30 sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
3 1-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, 
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Piscataway, N. J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fiision E. coli expression vectors include 
pTrc (Amrann et al, (1988) Gene 69, 301-315) and pET 1 Id (Studier et al y GENE 
5 Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, Gene Expression Technology: 

10 Methods in Enzymology 185, Academic Press, San Diego, CA. (1990) 1 19-128. 

Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized mE. coli (Wada etal, (1992) Nucleic Acids Res. 20, 2111- 
2118). Such alteration of nucleic acid sequences of the invention can be carried out by 

1 5 standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl 
(Baldari, et al, (1987) EMBO J. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
Ce//30, 933-943), pJRY88 (Schultz etal y (1987) Gene 54, 113-123), pYES2 

20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 
CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells {e.g., SF9 cells) include the pAc series (Smith et al, (1983) Mol Cell. Biol. 3, 
25 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC 
(Kaufinan et al. (1987) EMBO J. 6, 187-195). When used in mammalian cells, the 
30 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al, 
(Eds.) Molecular Cloning: A Laboratory Manual. 2 nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
In another embodiment, the recombinant mammaRan expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue- specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 

10 (1988) Adv. Immunol. 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (1989) EMBO J. 8, 729-733) and immunoglobulins (Banerji et 
al (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad Sci. USA 86, 5473-5477), pancreas-specific promoters (Edlund et al 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk 
whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g. y the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense NgR mRNA. Regulatory sequences operatively linked 

25 to a nucleic acid cloned in the antisense orientation can be chosen that direct the 

continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA, 
The antisense expression vector can be in the form of a recombinant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation 



BNSDOCID: <WO. 



_0229059A3_IA> 



WO 02/029059 



PCTYUS01/31488 



-23 - 

of gene expression using antisense genes see Weintraub et ah, Antisense RNA as a 
molecular tool for genetic analysis, REVIEWS— TRENDS IN GENETICS, Vol 1(1) 1986. 

Preferred vectors include, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 

10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 

pPROEXvectors™ (LIT, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 

15 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 

20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 

25 sequence encoding suitable mRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 

30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 
5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (THE BACTERIOPHAGE LAMBDA, Hershey, A.D. 

10 (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1973), which is 
incorporated herein by reference in its entirety; LAMBDA n, Hendrix, R.W. (Ed.), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1980), which is 
incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of E. coli and the SV40 early promoter (Benoist et al, (1981) Nature 290, 

15 304-3 10, which is incorporated herein by reference in its entirety). Additional 

promoters include, but are not limited to, mouse mammary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene ell 
of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by 
DNA encoding NgR and result in the expression of the mature NgR protein. 

25 Moreover, suitable expression vectors can include an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook et al 9 supra. 

An origin of replication can also be provided either by construction of the 

30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA An example of a suitable marker is dihydrofolate reductase (DHFR) or 
thymidine kinase (see, U.S. Patent No. 4,399,216). 

5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 

10 disclosed by Sambrook et al, supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
etal y (1983)Mo/. Cell Biol 3:280, Cosmanefa/. (1986)MoA Immunol 23:935, 
Cosman etal, (1984) Nature 312:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herein by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 

20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and 

protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immunoreactive with NgR. Host cells of the 

30 invention are also usefiil in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the cells, or from the medium in which the cells 
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are grown, by purification methods known in the art, e.g., conventional 
chromatographic methods including immunoaffinity chromatography, receptor affinity 
chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 
5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is 
expressed and purified as a fusion protein having' a specific tag, label or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 

10 Cleavage of the fusion component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

Knowledge of NgR DNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified (e.g., by 
homologous recombination) to provide increased expression by replacing, in whole or 

15 in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. If linked to the NgR coding sequence, amplification of the marker 

25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 

30 functional NgR or that express a variant of NgR. Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are useful as models for studying the 
in vivo activities of NgR and modulators of NgR. 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable ef expressing the cloned sequences. Suitable prokaryotic cells include, but are 
5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus. 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 

10 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure (see, Tissue Culture, Academic Press, Kruse 

15 and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia and Kluveromyces. 
Preferred yeast hosts are & cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast plasmid, an autonomously 

20 replication sequence (ARS), a promoter region, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 

25 expression system (see, Luckow et aL, Bio/Technology, 1988, 6, 47; BACULOVIRUS 
Expression Vectors: A Laboratory Manual, O r Rielly et aL (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 

30 production in insect cells. 

Suitable host cells are discussed further in Goeddel, Gene EXPRESSION 
TECHNOLOGY: Methods IN ENZYMOLOGY 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
5 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
10 transfecting host cells can be found in Sambrook, ei al. (MOJJECULAR CLONING: A 

LABORATORY Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
1 5 may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encoding a selectable marker can be introduced into a host cell on the same vector as 
that encoding NgR or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 
In a preferred embodiment, the polypeptides of the invention, including forms 
25 of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NgR/Ig 

fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR protein or 
polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-1 1, pXTl, 
pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are: SRcc, SV40, LTR, CMV, and HSV-TEC. Of these, CMV and Srce 
promoters are preferred. 

In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
SV40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 

10 portions, fragments and soluble constructs thereof; include the vector (such as one 

described above), into which the promoter is operably linked (preferably upstream) to 
the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin 

15 resistant gene is also operably contained in the vector. 

CHO cell lacking the DHFR gene (Urlaub, G. et al, (1980) Proc. Natl Acad 
Scl USA 77, 4216-4220) and CHO-K1 (Proa Natl Acad Set USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 

20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol 52, 456-467) and electroporation (Nuemann et 
al, (1982) EMBOJ. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 

25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene amplification and greater expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutively expressing the NgR constructs. 

30 Media used in cultivating CHO cells includes DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40°C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) NgR protein. Accordingly, the 
5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) in a suitable medium such that NgR protein is produced. In another 
embodiment, the method further comprises isolating NgR from the medium or the host 
10 cell. 

In situations where the NgR polypeptide will be found primarily intracellularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 

15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifugation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. 

20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a reducing agent such as dithiothreitol at alkaline pH or 
tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 

25 electrophoresis, immunoprecipitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for example, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) Meth. Enzymol 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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activity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 
5 necessarily the same chaotrope as used for the solubilization. It may be required to 

employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shu fflin g to occur in the 
formation of the protein's cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 
10 dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To 

increase the efficiency of the refolding, it may be necessary to employ a cosolvent, such 
as glycerol, polyethylene glycol of various molecular weights and arginine. 

Transgenic Animals 

15 The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 

20 homologous recombinant animals in which endogenous NgR sequences have been 

altered. Such animals are useful for studying the function and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 

30 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs: 1 or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 

10 further above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR 
transgene to direct expression of NgR protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 

15 animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 

20 NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which 

25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 
has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The 
NgR gene can be a human gene (e.g., SEQ ID NOs: 1 or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For example, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 

30 recombination vector suitable for altering an endogenous NgR gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted {i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NgR gene is mutated or otherwise altered but still 

5 encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3 ! 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the exogenous NgR gene carried by the vector and an 

1 0 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5* and 3' 
ends) are included in the vector. See e.g., Thomas et al. (1987) Cell 51 :503 for a 
description of homologous recombination vectors. The vector is introduced into an 

15 embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 

NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., U et al (1992) Cell 69:915). 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 113-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 

25 DNA by germline transmission of the transgene. Methods for constructing 

homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Curr. Opin. Biotechnol 2:823-829; PCT 
International Publication Nos.: WO 90/11354; WO 91/01140; WO 92/0968; and WO 
93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) 
Science 251:1351-1355. If a cre/loxP recombinase system is used to regulate 
5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al. (1997) Nature 385:810- 
813. In brief, a cell, e.g. , a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 

15 of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula or blastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragment antisense molecules of the invention include (i) those that 

25 specifically recognize and hybridize to NgR RNA (as determined by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commercially available sequence comparison programs. After identification of the 

30 desired sequences, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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performed. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID NO:l, 3, 13 or sequences complementary or homologous thereto, derived 
5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this 
enzyme or pathological conditions relating thereto. In specific aspects, antisense 

10 nucleic acid molecules are provided that comprise a sequence complementary to at 

least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, 
derivatives and analogs of a NgR protein of SEQ ID NO : 2, 4 or 14 or antisense 
nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs:l, 3 or 

15 13 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues (e.g., the protein coding region of human 

20 NgR corresponds to the coding region SEQ ID NO: 1, 3 or 13). In another 

embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding NgR The term "noncoding 
region" refers to 5' and 3 1 sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5 ! and 3' untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID NO: 1, 3, 13 or mRNA corresponding thereto, 
including, but not limited to, the initiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g. , SEQ 
ID NO: 1, 3 or 13), antisense nucleic acids of the invention can be designed according 

30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 
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or noncoding region of NgR mRNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 

15 cari)oxymethylaininomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carb oxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

30 The antisense nucleic acid molecules of the invention (preferably 

oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA 



BNSDOCID: <WO_ 0229059A3JA> 



WO 02/029059 



PC17US01/31488 



-37- 

and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translational level is useful to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 

10 specifically contemplated for therapeutic use by the invention. The antisense 
oligonucleotides may be further modified by adding poly-L-lysine, transferrin 
polylysine or cholesterol moieties at their 5' end. 

An example of a route of administration of antisense nucleic acid molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nucleic 

15 acid molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 

20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual p-units, the strands run parallel to each other (Gaultier et al, (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al, (1987) Nucleic Acids Res. 15, 613 1-6148) or a 

30 chimeric RNA-DNA analogue (Inoue et al, (1987) FEES Lett. 215, 327-330). 

The NgR sequences taught in the present invention facilitate the design of 
novel transcription factors for modulating NgR expression in native cells and animals, 
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and cells transformed or transfected with NgR polynucleotides. For example, the 
Cys 2 -His2 zinc finger proteins, which bind DNA via their zinc finger domains, have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 
5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal etal, (1999) 

10 Proc. Natl Acad Sci. USA 96, 2758-2763; Liu et al, (1997) Proc. Natl. Acad Sci. 

USA 94, 5525-5530; Greisman etal (1997) Science 275, 657-661; Choo etal, (1997) 
J. Mol. Biol. 273, 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a zinc finger protein consisting of 6 

1 5 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 
sequence (Segal et aL, (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fused to activation or repression 
domains to promote or suppress NgR expression (Liu et al., (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 

20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et al, (1997) Proc. 
Natl. Acad. Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode 

25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
express the transcription factor (gene therapy), or by introducing the protein. 
Engineered zinc finger proteins can also be designed to bind RNA sequences for use in 

30 therapeutics as alternatives to antisense or catalytic RNA methods (McColl et al, 
(1997) Proc. Natl. Acad Sci. USA 96, 9521-9526); Wu et al, (1995) Proc. Natl. 
Acad Set USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription factors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

5 Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes, 

10 described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to 
catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR 
mRNA. A ribozyme having specificity for a NgR-encoding nucleic acid can be 
designed based upon the nucleotide sequence of a NgRDNA disclosed herein (Le. 9 
SEQ ID NOs:l, 3 or 13). For example, a derivative of a TetrahymenaL-19 IVS RNA 

15 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA. 
See, e.g., Cech etal U.S. Patent No. 4,987,071; and Cechetal. U.S. Patent No. 
5,1 16,742. Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et ah, 

20 (1993) Science 261, 141 1-1418. 

Alternatively, NgR gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NgR (e.g. , the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
of the NgR gene in target cells. See generally, Helene (1991) Anticancer DrugDes. 6: 

25 569-584; Helene. et ah, (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher (1992) 
BioEssays 14, 807-815. 

In various embodiments, the nucleic acids of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

30 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup et ah, (1996) Bioorg. Med Chem. Lett. 4, 5-23). As used herein, the terms 
"peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 

5 solid phase peptide synthesis protocols as described in Hyrup et al t (1996) above; 
Perry-O'Keefe etal, (1996) Proc. Natl. Acad. Sci. USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup et al, (1996), above; Perry-O'Keefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA. Such 

20 chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), above and Finn et al (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
e.g., 5 f -(4-methoxytrityl) amino-5-deoxy-thymidine phosphoramidite, can be used 
between the PNA and the 5 1 end of DNA (Mag et al. (1989) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5' PNA segment and a 3 1 DNA segment (Finn et al (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
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and a 3" PNA segment. See, Petersen et al. (1975) Bioorg. Med Chem. Lett. 5:1119- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see Letsinger et al., (1989) Proc. Natl Acad Sci. 
USA 86, 6553-6556; Lemaitre etal, (1987) Proc. Natl. Acad. Sci. USA 84, 648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al., (1988) Biotechniques 6, 

10 958-976) or intercalating agents (see, e.g., Zon (1988) Phann. Res. 5, 539-549). To 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 

15 sequence of NgR. The NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 

20 molecule. The actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a frame shift 
in translation such that the predicted amino acid sequence will differ from that which 
would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 

25 starting at the point of the mutation. 

Polypeptides 

The invention also provides purified and isolated mammalian NgR polypeptides 
encoded by a polynucleotide of the invention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino acid sequence of NgR3, as set forth in SEQ ID NO:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 
5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID NO: 6, shown in Table 

15 5) excluding the previously characterized NgR ("NgRl"), and polypeptides comprising 
at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case 

20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 

dividing the number of matched positions by the total number of positions in the region 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 

30 in the sequence being compared, when four gaps in a length of 100 amino acids may be 
introduced to maximize alignment (Dayhofl^ in ATLAS OF PROTEIN SEQUENCE AND 
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STRUCTURE, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 

Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman (Adv. Appl Math., 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needleman and Wunsch (1970) J. Mol 

10 Biol 48, 443-453) the following settings for nucleic acid sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

15 BestFit was originally written for Version 1 .0 by Paul Haeberli from a careful reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 

20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14. 

Alternatively, homology may be determined by hybridization analysis wherein a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 

25 conditions. See e.g. Ausubel, et al , (Eds.) CURRENT PRqTOCOLS IN MOLECULAR 
Biology, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 
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chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of NgR protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 

5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 

10 produced, it is also preferably substantially free of culture medium, i.e., culture 

medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 

15 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 

20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 
5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgR protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 

25 or 14 that include fewer amino acids than the full length NgR proteins, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protein. A biologically active 
portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acids in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the NgR proteins (e.g., a 
conserved cysteine as the N-terminus of the mature protein, four conserved cysteines 
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in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 

5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 

10 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
DO NO: 2, 4 or 14. In other embodiments, the NgR protein is substantially homologous 
to SEQ ID NO:2, 4 or 14 and retains the functional activity of the protein of SEQ ID 
NO: 2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or 

15 mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mammalian host cells is expected to provide for such post-translational 

modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino acid residues 
supplement a NgR amino acid sequence. Insertions may be located at either or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
acid sequence. Insertional variants with additional residues at either or both termini 

30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 
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Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR acid sequence or to a biologically active fragment thereof 
Variant products of the invention also include mature NgR products, i.e., NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met -I -NgR) are contemplated, as are variants with additional methionine 
and lysine residues at positions -2 and -1 (Mef 2 -Lys" 1 -NgR). Variants of NgR with 
1 0 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly useful for enhanced. recombinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 

15 residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A "NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 

20 corresponding to a protein that is not homologous to the NgR protein, e.g. , a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least 
one biologically active portion of a NgR protein. In another embodiment, a NgR 

25 fusion protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fusion protein comprises at least three biologically 
active portions of a NgR protein. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fused in-frame to each other. The non-NgR polypeptide can be fused to the N- 

30 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fusion protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fusion 
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proteins can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 
5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fusion protein is a NgR protein containing a 
heterologous signal sequence at its N-terminus. For example, the native NgR signal 
sequence (i.e., amino acids 1-30 of SEQ ID NO:2 and amino acids 1-40 of SEQ ID 

10 NO:4) can be removed and replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fused to 

15 sequences derived from a member of the immunoglobulin protein family. The NgR- 
immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be 

20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be useful therapeutically for both the treatment of 
proliferative and diflferentiative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 

25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

30 techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 

restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 
5 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Ausubel et ah (Eds.) Current Protocols in Molecular Biology, 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A NgR- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
10 moiety is linked in-frame to the NgR protein. 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino terminus 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

15 In another aspect, the invention provides deletion variants wherein one or more 

amino acid residues in a NgR polypeptide are removed. Deletions can be effected at 
one or both termini of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g., ligand binding 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino 
acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred 

25 polypeptide fragments display antigenic properties unique to, or specific for, human 
NgR and its allelic and species homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 

30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 
Table 1. 





Column I 


Column EE 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






x, 


G, R, M 




x 2 


A,D,C 




x 3 


V, T 




x 4 


N.P.S 




x 5 


E,A,S 






nothing, K 


nothing 


x 7 


V,M,P 




x 8 


T, V 


V 


x, 


Q,P 


Q 




Q,A 


Q 


x„ 


Q,H,N 






G, N 


N 




L,F 


F 


X 14 


Q,A,S 




x 15 


A,S 




x 16 


V,I 




x 17 


V,T,E,L 






S, G 




X 19 


L,I 






A,E,V,P 




x 21 


A,S,D 




^22 


S,T 




X* 


Q,E 
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X aa # 


Column I 


Column O 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






X 24 


ivl 




. X 25 


Q,H 


Q 




N,G 


N 


X 27 


R.L 






T,G,R,S 




x 29 


F,L,T,H 




^•30 


L,V 


L 


X 31 


Q,R,P 




x 32 


Q,P,A 


P 


x 33 


G,A 


G 


X 34 


H,T,S 




x 35 


S,G,R 




x 36 


P,S,A 




X 3 7 


C, nothing 


nothing 


X 38 


R nothing 


nothing 


X 39 


A.N 




X 40 


M,L 




X 41 


V,L,T 




X 42 


T,I 


T 


X 43 


L,I 






Y,F,H 




X 45 


N,V 


N 


x^ 


I,L 






T,S,A 






F,Y,T,R 




X 49 


A,H,Y,D 
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X aa # 


Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 








P,A 


P 


x 51 


N,S,G,A 






T,A 


T 




E,RT 




X54 


G,H 




x 5S 


F,L 






V,Q,H 




^57 


H,AL 






E,Q 


E 


^59 


G,S 


G 




R,A 


R 


X« 


Q.H 


Q 




R, H 


H 


^63 


T,S 




^64 


L,V 


L 


X« 


AE.D 




x« 


E.D.A 




X 6 7 


Q,H 


Q 




V,E,G 






K,R 






H,Q 




x 71 


AS.T 




X72 


Y,H 




X73 


Y,D 


Y 


X 74 


K,R 






G,Q 





BNSDOC1D: <WO 0229059A3_IA> 



WO 02/029059 



PCT/US01/31488 



-52- 





Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






x 76 


S,Q 


S 




A,S,E 




x 78 


P,G 


P 


X79 


AGP 




X g0 


G,N 




X S i 


I,V,L 




X 82 


G,R 




Xg 3 


H,V,A 




x 84 


S,A 


s 


x g5 


D,E I 




X S 6 


H,S,A 




X 87 


I,L 




X 88 


E,L,Q 




X 89 


Y,H,A 




X90 


Q,P 


Q 




D, N 




X92 


I,L,T 




X93 


v,AR 




X94 


v,AG 




X95 


S,T 


s 


X96 


K,R 




X97 


L,I 


L j 


Xjg 


W,R,S 




X99 


S,L 




X 100 


L,V 


L 


X101 


G,T,P 
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Column I 


Column n 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 








Q,P,E 




-^■103 


G,H,R 




X 104 


I,T,V,A 






V,G,H 




X 106 


N,S 




^107 


E,G,Q 




x 108 


Q,R 






L,V 






Q,A 




^111 


W,G,H 




X m 


H,RP 




•^113 


K,A,H 




^114 


H,R 




^115 


D,G 




^116 


H,R,S,G 




Xll7 


T,M 


• 


^118 


T,I 






F,Y 




X 120 


N,A 




^121 


S,N 




X122 


T,A,S 






E,S,A 




Xj24 


Q,P 




X125 


G,T 






D3 


D 


X 127 


C,A 
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Column I 


Column n 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






•^128 


P,D 




X 129 


V,G,P,R 




^130 


A,S 




^131 


E,Q 


Q 


^132 


F,Y 


F 


-^-133 


G,A,D 




•^134 


A,P 






D,A,V 




^136 


G,D 




X 137 


A,E 




-^■138 


S,P 




^139 


E,A 




^140 


L,F 






RQ 




^142 


R,K 


R 


^143 


RK 


R 


^144 


F,A 




^143 


G,V 




^146 


A,D,E 




^147 


T,P 




^148 


A.V.S 




X 149 


T,S,L 




X150 


E,G,P,Q 




^151 


L,E,R 




^152 


RL 


R 


X153 


G,D 
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Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






-^154 


Q,H,A 




^155 


Q,R 




^156 


K,R 




^157 


L,A,R 




^158 


R,A 


R 


^159 


V.AJE 




-^160 


BAN 




^161 


F,L 


F 


^162 


R,Q 




^163 


N,A,G 





Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
15 contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 



20 



Table 2 
Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

ILV 
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Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR: 

Aromatic HFWY 

Other NQDE 
5 Alternatively, conservative amino acids can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 

(1975), pp. 71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions II 

10 

SIDE CHAIN 



CHARACTERISTIC 


AMINO ACID 


Non-polar (hydrophobic) 




A. Aliphatic: 


ALIVP 


B. Aromatic: 


F W 


C. Sulfor-containing: 


M 


D. Boderline: 


G 


Uncharged-polar 




A. Hydroxyl: 


STY 


B. Amides: 


NQ 


C. Sylfhydryl: 


C 


D. Boderline: 


G 


Positively Charged (Basic): 


KRH 


Negatively Charged (Acidic): 


DE 



As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 



Table 4 

Conservative Substitutions IH 

Original Exemplary Substitution 

Residue 



Ala (A) 


Val, Leu, Be 


Arg(R) 


Lys, Gin, Asn 


Asn (N) 


Gin, His, Lys, Arg 


Asp(D) 


Glu 


Cys(C) 


Ser 


Gln(Q) 


Asn 


Giu(E) 


Asp 
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XjJ.5 yTx J 


/VM1 3 vjrillj JL^yS, /VI g 


Tlo /TV 

ne (i; 


l^eu, vai, ivlcl, /via, rne, 


t mi n \ 


Tlpk X/q1 TV/To* Ala "PVio 

lie, vai, iviei, /via, i xie 




/VI VX1I1, /VM1 


lVLCt ^IVJL/ 


T All OhA Tlo 

iXU, rile, 116 


Php fiP^ 


1_/CU 5 V ol, -LIC, /Via. 


Pro (P) 


Gly 


Ser (S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val(V) 


lie, Leu, Met, Phe, Ala 



In addition, amino acid residues that are conserved among family members of 
15 the NgR proteins of the present invention, as indicated by the alignment presented 
herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region in NgRs. Examples of these conserved domains include, e.g., 
leucine-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 
alteration. 

Full-length NgRs have an LRR region characterized by the amino acid 
consensus sequence shown in SEQ ID NO: 19. At least some full-length NgRs also 
include a CT signaling (CTS) domain and a GPI domain. 
25 The NgR domain designations used herein are defined as follows: 



Domain 


hNgRl 
SEQ ID: 5 


mNgRl 
SEQ ID 
NO: 17 


hNgR2 
SEQ ID: 2 


HNgR3 
SEQ ID: 14 


mNgR3 
SEQ ID: 4 


Signal Seq. 


1-26 


1-26 


1-30 




1-40 


LRRNT 


27-56 


27-56 


31-59 




41-69 


LRR1 


57-81 


57-81 


60-82 


5-27 


70-92 ' 


LRR2 


82-105 


82-105 


83-106 


28-51 


93-106 


LRR3 


106-130 


106-130 


107-131 


52-76 


106-141 


LRR4 


131-154 


131-154 


132-155 


77-100 


142-165 
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LRR5 


155-178 


155-178 


156-179 


101-124 


166-189 


LRR6 


179-202 


179-202 


180-203 


125-148 


190-213 


LRR7 


203-226 


203-226 


204-227 


149-172 


214-237 


LRR8 


227-250 


227-250 


228-251 


173-196 


238-261 


LRRCT 


260-309 


260-309 


261-310 


206-255 


271-320 


CTS 
(CT 

Signaling) 


310-445 


310-445 


311-395 


256-396 


321^38 


GPI 


446-473 


456-473 


396-420 


370-392 


439-462 



10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain functionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications include conservative amino acid substitutions. Preferably such 

15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 

20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ID NO:2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS:l, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions can be made at one or more amino acid residues 

25 predicted to be non-essential. Alternatively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e.g., by saturation 
mutagenesis. The resulting mutants can be screened for NgR biological activity. 
Biological activities of NgR may include but are not limited to: (1) protein:protein 
interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo- 

30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 
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It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
covalent in nature, and include for example, chemical bonding with polymers, lipids, 
5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
further embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 polyoxyethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly useful in assays of the 
invention; the variants are also useful in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

1 5 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected from the group consisting of, for example, polyethylene 

20 glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyoxyethylated polyols {e.g. glycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 

25 an aldehyde for alkylation, so that the degree of polymerization may be controlled. 
Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. When the chemically modified NgR 
polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Alternatively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-CIO alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 
5 reactions known in the art, as described, for example, in the following references: 

Focus on Growth Factors 3, 4-10 (1992); EP 0 154 316; and EP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 

10 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 

15 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 

20 reaction products. It will be apparent to one of ordinary skill in the art to select the 

optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymer:NgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 

25 polypeptides described herein. However, the chemically-derivatized polymenNgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as compared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof variants and derivatives, may be employed alone, together, or in 

30 combination with other pharmaceutical compositions. The cytokines, growth factors, 
antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutical^ acceptable (i.e., sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium. 
5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, 
dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 
Variants that display ligand binding properties of native NgR and are expressed 

10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 

15 present invention, one skilled in the art can identify and obtain nucleotide sequences 

which encode NgR from different sources (i.e., different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrooke/a/., Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Haibor Press, Cold Spring Harbor, NY (1989), which is 

20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herein. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 

25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et ah (1989) above. 

A nucleic acid molecule comprising any of the NgR nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 

30 nucleotide sequences provided herein. See U.S. Patent Nos. 4,683,195 to Mullis et al 
and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or 
double-stranded DNA. The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 
5 mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger et ah, Guide to Molecular Cloning Techniques, METHODS IN 
. ENZYMOLOGY 1 52 Academic Press, San Diego, CA, which is incorporated herein by 
10 reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 

15 Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 

20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or fragments thereof Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/11236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab'>2, and F v , are 

25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgR polypeptides exclusively (i.e., are able to distinguish 
NgR polypeptides from other known NgR polypeptides by virtue of measurable 
difFerences in binding affinity, despite the possible existence of localized sequence 

30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of 

BNSDOCID: <WO 0229O59A3JA> 



WO 02/029059 



PCT7US01/31488 



-63 - 

NgR such that an antibody raised against the peptide forms a specific immune complex 
with NgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least 15 amino acid residues, even more preferably at least 20 amino 
acid residues, and most preferably at least 30 amino acid residues. Preferred epitopes 
5 encompassed by the antigenic peptide are regions of NgR that are located on the 
surface of the protein, e.g., hydrophilic regions. 

It will be understood that specific antibodies may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELISA techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 

10 in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et ah 
in Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments 

15 of the NgR polypeptides of the invention are also contemplated, provided that the 
antibodies are specific for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by injection with the native 

20 protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed NgR 
protein or a chemically synthesized NgR polypeptide. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin arid Corynebacterium parvwn or similar immunostimulatory 
agents. If desired, the antibody molecules directed against NgR can be isolated from 
the mammal (e.g., from the blood) and further purified by well known techniques, such 

30 as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one species 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding affinity for 
a particular NgR protein with which it immunoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 
5 analogs or homologs thereof, any technique that provides for the production of 

antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not limited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et al, (1983) Immunol. Today 4, 72) and the EBV hybridoma 

10 technique to produce human monoclonal antibodies (see Cole et al, (1985) in 

Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
Human monoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et ah, (1983) Proa Natl. 
Acad Sci. USA 80, 2026-2030) or by transforming human B-cells with Epstein Barr 

15 Virus in vitro (see Cole etal, (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to a NgR protein (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries 
(see e.g., Huse et al, (1989) Science 246, 1275-1281) to allow rapid and effective 

20 identification of monoclonal F^ fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 

25 into the antibody framework to modulate affinity or immunogenicity. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art including, but not limited to: (i) an F(ab , ) 2 fragment produced by 
pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab f ) 2 fragment; (iii) an Fab fragment generated by the 

30 treatment of the antibody molecule with papain and a reducing agent and (iv) F v 
fragments. 
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Additionally, recombinant anti-NgR antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 
5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCT International Application No. PCT/US 86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT International Publication No. WO 86/01533; 
U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better et al, 

10 (1988) Science 240, 1041-1043; Liu et al, (1987) Proc. Natl Acad Sci. USA 84, 
3439-3443; Liu etal, (1987) J. Immunol 139, 3521-3526; Sun etal, (1987) Proc. 
Natl Acad. Sci. USA 84, 214-218; Nishimura etal s (1987) Cancer Res. 47, 999-1005; 
Wood etal 9 (\9S5)Nature 314, 446-449; Shaw et al,. (1988),/. Natl Cancer Inst. 
80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oi et al, (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al, (1986) Nature 321, 
552-525; Verhoeyan et al, (1988) Science 239, 1534; and Beidler et al, (1988) J. 
Immunol 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 

20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 

25 IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such 
antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 

30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a 
NgR protein possessing such a domain. Antibodies that are specific for one or more 
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domains within a NgR protein, e.g., domains spanning the above-identified conserved 
regions of NgRs, or derivatives, fragments analogs or homologs thereof are also 
provided herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein {e.g., for use in measuring levels 
of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as pharmacologically- 

10 active compounds [hereinafter "Therapeutics"]. 

An anti-NgR antibody (e.g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can facilitate the purification of natural NgR from cells and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 

15 can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin and aequorin, and examples of suitable radioactive material include 

30 ^V^Sor 3 !!. 

Another aspect of the present invention is directed to methods of inducing an 
immune response in a mammal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 

5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 

10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized animal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-producing cells, such as splenic cells, of the immunized animal may be used 

15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 

20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the binding of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 

25 an antibody response with the same specificity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image 
anti-idiotype" and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) ANTI-IDIOTYPIC ANTIBODIES AND INTERNAL IMAGE, 
in Monoclonal and anti-idiotypic antibodies: Probes for receptor structure 

30 AND function, Venter J.C. et al (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etal f (1986) 232, 220-223; 1047; McNamara etal, (1985) Science 226, 1325-1326). 
Internal image anti-idiotypic antibodies have also been shown to induce immunity to 
tumor related antigens (Raychauhuri et al 9 (1986) J. Immunol 137, 1743-1749; 
Raychauhuri etal, (1987) J. Immunol 139, 3902-3910; Bhattacharya-Chatterjee et 
5 al, (1987) J. Immunol 139, 1354-1360; Bhattacharya-Chatterjee et al, (1988) J. 
Immunol 141, 1398-1403; Herlyn. etal (1989) Intern. Rev. Immunol 4, 347-357; 
Chen et al (1990) Cell Imm. Immunother. Cancer 351-359; Herlyn et al, (1991) in 
vivo 5, 615-624; Furuyae/a/. (1992) Anticancer Res. 12, 27-32; JVGttelman, A. etal 
(1992) Proc. Natl Acad Set, USA 89, 466-470; Durrant. etal, (1994) Cancer Res. 
10 54, 4837-4840; Mittelman. et al (1994) Cancer Res. 54, 415-421; Schmitt. a/. 

(1994) Hybridoma 13, 389-396; Chakrobarty. *tf a/. (1995) J. Immunother. 18, 
95-103; Chakrobarty. e/a/. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. efa/. 

(1995) Clin. Caticer Res. 1, 1205-1294; Herlyn et al (1995) Hybridoma 14, 159-166; 
Sclebusch (1995) Hybridoma 14, 167-174; Herlyn. etal (1996) Cancer 

15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 

20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the immunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 \ig 

25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the immunization period using standard 

3 0 immunoassay methods. Animals with suitable titers of antibodies reactive with the 

target monoclonal antibodies may be reimmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, spleen cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fused with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 
5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprising an antibody of the invention for any of 
the purposes described herein are also comprehended. 

Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

15 The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpful for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amount of NgR in the sample with a 
standard. The compound or agent can be packaged in a suitable container. 

25 Screening Assays 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds include solution assays, in vitro assays wherein NgR polypeptides are 

30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with Ngjt normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay ft ) for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules (e.g., molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stimulatory- or inhibitory effect on, 
5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 

10 library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 

15 non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al, (1993) Proc. Natl Acad Sci. USA 90, 6909; Erb 
etal, (1994) Proc. Natl. Acad Sci. USA 91,11422; Zuckermann et al. (1994) J. Med. 

20 Chem 37, 2678; Cho et al, (1993) Science 261, 1303; Carrell et al, (1994) Angew 
Chem. Int. Ed. Engl. 33, 2059; Carell et al, (1994) Angew Chem. Int. Ed Engl. 33, 
2061; and Gallop et al, (1994) J. Med Chem 37, 1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 

25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores (Ladner, above), plasmids (Cull et al. (1992) Proc. Natl Acad. Sci. USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal. (1990) Proc. Natl. Acad Sci. USA 87, 6378-6382; 
Felici (1991) J. Mol. Biol 222, 301-310; Ladner, above). 

30 
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1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a NgR polypeptide expressed on the surface 
5 of a cell with a candidate binding partner compound and detecting binding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 
10 (e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion 
thereof. 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

15 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 

20 labeled compound in a complex. For example, test compounds can be labeled with 
I, S, C, or H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be en2ymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or luciferase, and the enzymatic label detected by determination 

25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protein 
or a biologically active portion thereof; on the cell surface with a known compound 
which binds NgR to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a NgR 

30 protein, wherein determining the ability of the test compound to interact with a NgR 
protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 
5 binds or interacts in nature, for example, a molecule on the surface of a cell which 

expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a NgR 

10 target molecule is a component of a signal transduction pathway that facilitates 
transduction of an extracellular signal (e.g., a signal generated by binding of a 
compound to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 

1 5 molecules with NgR In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compounds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgR products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestly useful in modulating (i.e., blocking, inhibiting or stimulating) biological 
activities of NgR, especially those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgR binding 
partners. In solution assays, methods of the invention comprise the steps of (a) 
30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compounds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolating the NgR 
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polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the physical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the invention. In one aspect, the NgR polypeptide/binding 
5 partner complex is isolated using an antibody immunospecific for either the NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate binding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 
10 polypeptide/binding partner complex through interaction with the label or tag. An 
exemplary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 

20 detecting binding of the candidate compound to the NgR polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
i binding of NgR is detected. Immobilization is accomplished using any of the methods 
well known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 

25 antibody binding, or use of streptavidin/biotin binding wherein the immobilized 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 

30 tubes. In one embodiment, a fusion protein can be provided that adds a domain that 
allows one or bpth of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be 
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adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 
5 (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either directly or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 
10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 
by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

Detection of binding can be accomplished (i) using a radioactive label on the 
compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, (iii) using an antibody immunospecific for the 
non-immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, (v) 
determining the activity of the NgR, as well as other techniques well known and 
routinely practiced in the art. 
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Determining the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target {i.e. 
intracellular Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherein determining 
the ability of the test compound to interact with a NgR protein comprises determining 

15 the ability of the test compound to preferentially bind to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compound which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 

20 determining the ability of the test compound to interact with a NgR protein, wherein 
determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the NgR protein to modulate the activity of a NgR target 
molecule. 

The cell-free assays of the present invention are amenable to use of both the 
25 soluble form or the membrane-bound form of NgR. In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
30 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 

Isotridecypoly(ethylene glycol ether) n , 3-(3-cholamidopropyl)dimethylainminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate. 

Modulators 

5 Agents that modulate (i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the effect of the putative 
modulator on NgR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its effects on NgR to its 

10 effect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will 
be therapeutically useful in treatment of diseases and physiological conditions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 

15 modulators may be used in the treatment of such diseases and conditions associated 
with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 

20 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 

25 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner compound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
30 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein in the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of NgR expression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 
1 0 determined by methods described herein for detecting NgR mRNA or protein. 

High Throughput Screening 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity (i.e., affect 

1 5 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an efficient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds" having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 

20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic acid molecule encoding the same, with a 

25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by binding assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays, 
Western blots, radiolabeled competition assay, phage-based expression cloning, 
co-fractionation by chromatography, co-precipitation, cross linking, interaction 

30 trap/two-hybrid analysis, southwestern analysis, ELISA and the like, which are 

described in, for example, Ausubel etal. (Eds.), Current Protocols in Molecular 
Biology, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et aL 9 
(1993) Cell 72, 223-232; Madura etaL, (1993) J. Biol Chem. 268, 12046-12054; 
Bartel et al. 9 (1993) BioTechniques 14, 920-924; Iwabuchi et aL, (1993) Oncogene 8, 
5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 
with NgR ("NgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
NgR-binding proteins are also likely to be involved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 
Other assays may be used to identify specific ligands of a NgR receptor, 
10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 
15 Fields et al, (1989) Nature 340, 245-246, and Fields et al. y (1994) Treivls Genet 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 
20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 
25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein 
and (2) an activation domain fused to a second protein. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 
30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fragment thereof that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 
with the binding interaction. Expression of the reporter gene is monitored as different 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened include (which may include 
compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological or chemical origin. 

The function of the NgR gene product is unclear and no ligands have yet been 
found which bind the gene product. The yeast two-hybrid assay is useful to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fragment thereof a fusion polynucleotide encoding both a NgR 
receptor (or fragment) and a UAS binding domain {i.e., a first protein) may be used. 
In addition, a large number of hybrid genes each encoding a different second protein 
fused to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding region being fused to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 
necessary to know the identity or function of the second binding protein. The system 
- is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bind to the target protein. 
One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 
target protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protein. The function of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 
5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldt et al (1997) Anal Chew. 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 
10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
from the target protein and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 
15 with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a 
fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic 

20 label Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a 
test may either be free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly or associated with a portion of a cell. One skilled in the art 

25 can, for example, measure the formation of complexes between NgR and the 
compound being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 

30 compounds which modulate (i.e., increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compound modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sample 
containing the test compound is higher than the activity in the sample lacking the test 
compound, the compound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
include (which may include compounds which are suspected to modulate NgR 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 

10 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 

1 5 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phospholipases and ion channels. 
Thus, modulators of NgR activity may alter a NgR receptor function, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 

25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [ S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLEPR assay for 
intracellular Ca 2+ concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay (e.g., using [ H]-arachidonic acid) and an assay for 

30 extracellular acidification rates, as well as other binding or function-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the binding of a natural or an unnatural ligand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 

5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
function as activators or inhibitors (competitive, uncompetitive and non-competitive) 
(e.g., antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 

15 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example, Enzyme Assays: A Practical Approach, Eisenthal and 

20 Danson (Eds ), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of unknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 

25 of radiolabeled ligands in HTS binding assays for drug discovery (see Williams (1991) 
Med Res. Rev., 11, 147-184; Sweetnam etal, (1993) J. Nat. Prod. 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 

30 Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herein by 
reference in its entirety). 
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A variety of heterologous systems is available for functional expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg et al (1992) Trends Pharmacol Set 13, 95-98), yeast 
(Pausch (1997) Trends Biotechnol. 15, 487-494), several kinds of insect cells (Vanden 

5 Broeck (1996) Int. Rev. Cytol 164, 189-268), amphibian cells (Jayawickreme et al 
(1997) Curr. Opin. Biotechnol 8, 629-634) and several mammalian cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt etal (1997) Eur. J. Pharmacol 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). 

0 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compounds with 
NgR and assaying for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in bound form, and the amount of free or uncomplexed 

5 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compounds are contacted with NgR and washed. Bound NgR is then 

3 detected by methods well known in the art. Purified polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed NgR can be used for HTS binding assays in 

> conjunction with its defined ligand. The identified peptide is labeled with a suitable 
radioisotope, including, but not limited to, 125 I, 3 H, 35 S or 32 P, by methods that are 
well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) Drug Discov. Today 2, 156-160). Radioactive 

) ligand specifically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (Williams (1991) Med Res. Rev. 11, 147-184; 
Sweetnam et al (1993) J. Nat. Prod. 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1, 85-91 Bosse etal 
5 (1998) J. Biomol Screening 3, 285-292). Binding of fluorescent ligands can be 
detected in various ways, including fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers (1997) DrugDiscov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

10 Examples of such biological responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in Biotechnol. 15, 487-494); changes in 
intracellular Ca 2+ concentration as measured by fluorescent dyes (Murphy et al (1998) 
Cur. Opin. Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to 

15 monitor ligand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
et al (1996) J. Biomol Screening 1, 75-80). Melanophores prepared fromXenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 

20 (Jayawickreme et al (1997) Curr. Opin. Biotechnol 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 

25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor. In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain significant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used in receptor binding assays, employing the 

30 radiolabeled ligand specific for the particular receptor. Alternatively, a functional 
assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 24 * 
concentration or membrane potential in permanently transfected CHO cells containing 
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each of these receptors individually or in combination would be preferred for HTS. 
• Equally preferred would be an alternative type of mammalian cell, such as HEK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art (e.g., Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is 
immobilized and interaction with a binding partner is assessed in the presence and 
10 absence of a candidate modulator such as an inhibitor compound. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 
1 5 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 
20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs of 
known compounds or analogs of compounds that have been identified as "hits" or 
25 "leads" in other drug discovery screens, some of which are derived from natural 
products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 
30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof For a review, 
see Cane et aL, Science (1998) 282, 63-68. Combinatorial libraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 
5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 
polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin. BiotechnoL 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as well as such peptide modulators as neuropeptides other 

15 than natural ligands, antibodies, antibody fragments, and modified compounds 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the 

20 invention specifically compete with a test compound for binding to the polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin et aL 9 (1997) Antimicrob. Agents Chemother. 41, 
2127-213 1, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 

30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 
5 In one embodiment of affinity purification, a polypeptide is covalently coupled to a 

chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered from the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 
10 This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 

15 time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 

20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving from the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are useful to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 

25 sequences present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 

30 designed. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 
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Compositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein, "pharmaceutically acceptable carrier" is intended to include 

10 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 

isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are useful in this context are, 

15 inter alia , found in Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline, Ringer's 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 

20 agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 

25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g. y intravenous, intradermal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 

30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

0 water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 
In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

5 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

) antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 

> example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in ah 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

► incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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preparation are vacuum drying and freeze-drying that yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They 
5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 

10 Pharmaceutically compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant 

15 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 

20 propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 

25 salts, and fiisidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
30 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
10 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. It is 
especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
15 treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved^ 
20 The nucleic acid molecules of the invention can be inserted into vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e.g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen etal. (1994) Proc. Natl. Acad. 
25 Set USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 
30 produce the gene debvery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be administered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 including but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herein, or any portion thereof can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 

1 5 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 

20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizing (stimulating) or 

antagonizing a NgR natural binding partner associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to effect said agonism or antagonism. One 
embodiment of the present invention, then, is a method of treating diseases in a 

30 mammal with an agonist or antagonist of the protein of the present invention 
comprising administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize NgR-associated functions. 
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Methods of detennining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 
5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 
10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 
1 5 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 
20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, compounds 
displaying good pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell 
25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 
30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
' Panel on Euthanasia, (1993) J. Am. Vet. Med Assoc. 202:229-249). Representative 
5 animals from each treatment group can then be examined by gross necropsy for 

immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 
10 less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 
15 Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs. 
1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/ligands which activate, agonize, or antagonize NgR and 
20 for compounds with potential utility in treating disorders including CNS disorders 

(e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, 
e.g., Donovan et aL, (1997) J. Neuvosci. 17, 53 16-5326; Turgeon et aL, (1998) J. 
25 Neurosci. 18, 6882-6891; Smith-Swintosky et aL, (1997) J. Neurochem. 69, 

1890-1896; Gill et aL, (1998) Brain Res. 797, 321-327; Suidan^a/., (1996)Semin. 
Thromb. Hemost. 22, 125-133. 
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Pharmacogenoraics 

Agents, or modulators that have a stimulatory or inhibitory effect on Ngjl 
activity (e.g., NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) 
5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 

10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of NgR protein, expression of NgR nucleic acid or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

20 persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol. 23, 983-985 and 
Linder (1997) Clin. Chem. 43, 254-266. In general, two types of phannacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drugs (altered drug 

25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency 
is a common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
of a Srug. These polymorphisms are expressed in two phenotypes in the population, 
5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 
10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
15 amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 
20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression 
or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical 
30 trials. For example, the effectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protein levels or upregulate NgR 
activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downregulated NgR activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NgR gene 
expression, protein levels, or downregulate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgR gene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immune responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent (e.g. , compound, drug or small molecule) that modulates NgR activity 
(e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression (i.e., a 
15 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 

described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 

25 identified by the screening assays described herein) comprising the steps of (z) 

obtaining a pre-administration sample from a subject prior to administration of the 
agent; (ii) detecting the level of expression of a NgR protein, mRNA, or genomic 
DNA in the preadministration sample; (in) obtaining one or more post-administration 
samples from the subject; (/v) detecting the level of expression or activity of the NgR 

30 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the NgR protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of NgR to higher levels than detected, 
te. 9 to increase the effectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 
than detected, i.e., to decrease the effectiveness of the agent. 

Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

15 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (/) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (//) antibodies to a NgR 
peptide; (Hi) nucleic acids encoding a NgR peptide; (rv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 

20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 

25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic maimer. 

30 Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof, or an agonist that increases 
bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample {e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
include, but are not limited to, immunoassays (e.g., by Western blot analysis, 
immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 
disease or condition associated with an aberrant NgR expression or activity, by 
administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
agent can occur prior to the manifestation of symptoms characteristic of the NgR 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

10 Gene Therapy 

Mutations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore Ngjt activity to treat those disease states. Delivery of a functional 
NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 

15 and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature, supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Sci. Am. 68-84; and Miller (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other human disease 

states, preventing the expression of, or inhibiting the activity of, NgR will be useful in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the expression of NgR. 

The present invention provides for both prophylactic and therapeutic methods 

25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (ii) antibodies to a NgR 
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peptide; (Hi) nucleic acids encoding a NgR peptide; (zv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout 11 endogenous function of a NgR peptide by homologous 

5 recombination (see, e.g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 

10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 

15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 

20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 

In one aspect, the invention provides a method for preventing, in a subject, a 

25 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
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NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate Iigand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 

10 or more NgR protein activity. Examples of such stimulatory agents include active 

NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g. , by 

15 culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

The present invention is not to be limited in scope by the specific embodiments 

25 described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figure. Such modifications are intended to fall within 
the scope of the appended claims. 

The following Table 5 contains the sequences of exemplary polynucleotides 

30 and polypeptides of the invention. 
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TABLES 

The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 



10 



15 



20 



25 



30 



35 



40 



ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 
CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 
CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 
AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTT 
GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 
ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 
TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 
TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 
TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAAC 
CTGAGCCACCTCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 
GGCCTGGGCAGCCTGGACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGGCGTGCACCGC 
GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTGGCC 
TCGCTGCCCGGCGAGGCGCTCGCCGACCTGCCCTCGCTCGAGTTCCTGCGGCTCAACGCT 
AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 
GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 
GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 
CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 
CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 
GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 
GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 
CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 

The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid 

sequence derived from the DNA sequence of SEQ ID NO. 1 : 

MLPGLRRLLQAPASACLLLMLLALPLAAPSC 

PMLCTCYSSPPTVSCQANNFSSVPLSLPPST 

QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 

LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 

DTFQGLERLQSLHLYRCQLSSLPGNIFRGLV 

SLQYLYLQENSLLHLQDDLFADLANLSHLFL . 

HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQG 

VHRA AFRGLSRLTILYLFNNSLASLPGEALA 

DLPSLEFLRLNANPWACDCRARPLWAWFQRA 

RVSSSDVTCATPPERQGRDLRALREADFQAC 

PPAAPTRPGSRARGNSS SNHLYGVAEAGAPP 

ADPSTLYRDLPAEDSRGRQGGDAPTEDDYWG 

GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 

GLP SPLLCLLLLVPHHL 
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The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGCTCAGGTCTCA 
GGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAGAGCTACCTCTGGGTGGTGGT 
TGTCCTCGAGACTGTGTGTGCTACCCTGCGCCCATGACTGTCAGCTGCCAGGCACACAAC 
TTTGCTGCCATCCCGGAGGGCATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAAT 
CGCATCACCTTCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTAC 
TCCAACAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTGCATCTGGAGGAG 
CTAGACCTTGGAGACAACCGACAGCTGCGAACGCTGGCACCCGAGACCTTCCAAGGCCTG 
GTGAAGCTTCACGCCCTCTACCTCTATAAGTGTGGACTGAGCGCCCTGCCCGCAGGCATC 
TTTGGTGGCCTGCACAGCCTGCAGTATCTCTACTTGCAGGACAACCATATCGAGTACCTC 
CAAGATGACATCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG 
CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCGGTTGCTGCTG 
CATGAGAACCAGCTACAGTGGGTTCACCACAAGGCTTTCCATGACCTCCACAGGCTAACC 
ACC CTCTTTCTCTTCAACAACAGCCTC ACTGAGCTGCAGGGTGACTGTCTGGC CCCC CTG 
GTGGCCTTGGAGTTCCTTCGCCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGT 
TCCCTGTGGGAATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC 
CCCGAGCTGCGGCAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTTCCGGAACTGC 
ACAGGACCAGTGTCTCCTCACCAGATCAAGTCTCACACGCTTACCACCTCTGACAGGGCT 
GCCCGCAAGGAGCACCATCCGTCCCATGGGGCCTCCAGGGACAAAGGCCACCCACATGGC 
CATCCGCCTGGCTCCAGGTCAGGTTACAAGAAGGCAGGCAAGAACTGCACCAGCCACAGG 
AACCGGAACCAGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT 
GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCACCGCACGACCCAAGAGG 
AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTGCAGCAGGCATCC 
TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 
CGC 



30 



35 



40 



The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID 

NO:3: 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
A GELPLGGGCPRDC VCYP APMTVS CQAHNF A 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
MVTLWI YSNNITFIAPNTFEGFVHLEELDLG 
DNRQLRTL APETFQGL VKLH ALYLYKCGL S A 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIF V 
DLVNLSHLFLHGNKLWSLGQGIFRGLVNLDR 
LLLHENQLQWVHHKAFHDLHRLTTLFLFNNS 
LTELQGDCLAPLVALEFLRLNGNAWDCGCRA 
RSLWEWLRRFRGSSSAVPCATPELRQGQDLK 
LLRVEDFRNCTGPVSPHQIKSHTLTTSDRAA 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQDYAPD 
YQHKFSFDIMPT ARPKRKGKCARRTPIRAPS 
GVQQASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO. 5> is NgRl from humans: 

MKRAS AGGSRLLAWVLWLQAWQVAAPCPGA 
C 

CYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVL 
ARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRIS SVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLS ALPTEALAP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFR 
GSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAA 
DKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 
EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALV 
LWTVLGPC 

The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXL WX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 



The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGVIQAIQKSDEGHPFRAYLESEVAISE 
ELVQKYSNSALGHVNCTIKELRRLFLVDDLV 
DS L K 
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The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSC Q A NNFSSVPLSLPPS 
TQRLFLQNNLIRTLRP GTFG SNLLTLWL F-S N 
NLSTIYPGTFRHLQALEELDLGDNRHLRSLE 
PDTFQGLERLQ SLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANLSHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNNSLASLPGEAL 
ADLPSLEFLRLNANPWACDCRARPLWAWFQR 
ARVSSSDVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPDSRGPALS 
AGLPSPLLCLLLL VP H H L 



The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the 
mature NgR3 : 

CPRDCVCYPAPMTVS CQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHP SHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
ALGA PL L AW I L G L A VT LR 



The following amino acid sequence <SEQ ID NO:10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 
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The following amino acid sequence <SEQ ID NO: 1 1> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXSSSXVXCXXP 
XXX XGXDLXXLXXXDXXXC 

5 



The following amino acid sequence <SEQ ID NO:12> is a conserved Leucine-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
10 XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXLA 
15 XLXXL XX LR L 



Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 

20 are prophetic. 

As shown by the following Examples, a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 

25 (SEQ ID NO: 10) N-terminal to the leucine-rich region, a second cysteine-rich domain 
(SEQ ID NO: 1 1) C-terminal to the leucine-rich region, and a putative 
glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the 
previously identified NgR sequence. The NgR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and 

30 NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 



Example 1: Tblastn query of the HTG database 

The protein sequence for the human NgR (NgRl) (SEQ ID NO: 5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 
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NIH genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences (Nucleic Acids Res. (2000) 28, 15-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus wh en one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the number of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al, (1990) J. Mol Biol 215, 

10 403-410, which is incorporated herein by reference in its entirety). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 

1 5 positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 

alignment score falls off by the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 

25 alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (seeHenikoff etal, (1992) Proc. Natl Acad Sci. USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) alignments 
(B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin et al, (1993) Proc. Natl Acad Set USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 
statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability CP(N))> which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, 
5 preferably less than about 0. 1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 

10 in all reading frames (J. Mol Biol (1990) 215, 403-410: Nucleic Acids Res. (1997) 
25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR. In addition to finding hits to 
genomic clones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 

1 5 clones were found (Accession numbers: AC0685 14, AC01 6869, AC013606) with an 
e-value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an 
e-value of le-78. The three human clones all appeared to encode the same gene, so 
further analysis was confined to AC013606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 

The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ (J. Mol Biol (1997) 268, 78-94) which can identify 

25 complete exon/intron structures of genes in genomic DNA. The gene prediction by 
GENESCAN™ contained seven exons. By comparing these predicted exons to the 
NgR, it was concluded that the new human gene contains two of these exons and a 
part of another (containing the initiating methionine). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC013606 (HTG1 1; deposited 

30 March 2000; length = 143899; GenBank release 1 18.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID NO:l (nucleotide sequence of human NgR2; AC013606). The 
translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID 
NO:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the NgR2 mRNA is expressed in a number of tissues including fetal brain. 
Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GB_EST19:AI346757) contains 565 nucleotides 
corresponding to amino acids 84-271 of the human NgR2 (SEQ ID No:4). This spans 

10 the second intron located between amino acids 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GB_EST26:AI929019) contains 545 nucleotides, part of which corresponds to amino 
acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located 
between amino acids 10 and 1 1, and provides positive evidence for the splicing of 

15 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The human NgR protein sequence aligned with only one region of translated 
sequence from nucleotide sequence AC021768, indicating that most of the new mouse 
gene was encoded by one large exon. However, upon inspection, the protein encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one found by visual 
inspection arid a short one at the 5' end which provided an initiating methionine. The 
predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 
(HTG14; deposited March 2000; length = 215980; GenBank release 1 18.0; SEQ ID 
NO: 16) by combining nucleotides from the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide 
sequence of mouse NgR3; AC021768). The translation of this cDNA provides the 
protein sequence of mouse NgR3 (SEQ ID NO:4). 
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We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20: AI428334) contains 463 
nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 NO:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 
1 A-1B. The similarities between these proteins include: 
10 (!) The SignalP program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 
Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) All proteins contain eight Leucine Rich Repeats (LRR). LRRs are short 
sequence motifs present in a number of proteins with diverse functions and cellular 

15 locations. These repeats are usually involved in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domain 
(LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 ( 4 ) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished from those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain. 

(5) All three proteins contain a conserved cysteine in the fourth LRR 
25 domain. 

(6) All three proteins contain a conserved potential glycosylation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgRl, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to 
remain tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

A Nogo binding assay was developed which utilizes a method widely used in 
examining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen (1998) Anmi. Rev. Neurosci. 21,3 09-345; Takahashi etal, (19^9) 
5 Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
expression vector is created encoding a signal peptide, a His6 tag for purification, AP, 
and the 66 amino acid active domain of Nogo. The fusion protein can be purified from 

10 . the conditioned medium of transfected cells in milligram amounts. This protein is 
biologically active as a growth cone collapsing agent with an EC 50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 

15 GST-Nogo may be purified from the culture medium and used as a GST fusion 

protein, or GST may be cleaved from the Nogo portion of the fusion protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fusion protein. Such sites are part of the 
commercially available GST vectors. The specific cleavage sites and enzymes may be 

20 used in accordance with the Manufacturer's specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaryotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELISA-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a competitive binding assay 
using increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 



30 Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 
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encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 
5 indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino 
acids derived from Nogo. Furthemore, specific affinity of Nogo for the NgR2 or 
NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 

1 5 well known techniques for Northern blotting. The blot is probed with a fragment of 
NgR2 (SEQ ID NO;l) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal 
Biochem. 132, 6-13). Hybridization is carried out at 68°C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 

20 SSC/0.05% SDS at room temperature and two washes with 0. IX SSC/0. 1% SDS at 
50°C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 

Example 8: Cloning of cDNA corresponding to NgRs 

25 

To obtain the full-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2. Such a tissue was identified in 
Example 2. 1 x 10 6 plaque forming units from the cDNA library may be screened in 
30 duplicate on OPTITRAN™ filters. The filters are hybridized with 32 P-labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (50mM Tris pH 
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7.5, IMNaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
Ficoll) containing 10% Dextran sulfate and 100p,g/ml tRNA and 80 pmol each 
32 P-labeled oligonucleotide at 65°C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to film. Duplicate positives 
5 are purified. DNA from each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the 32 P-labeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
10 full-length clone corresponding to NgR3 . 

Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art employing conventional PCR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

15 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the brain, for example, 
coronal and sagittal rat brain cryosections (20 p,m thick) are prepared using a Reichert- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free 
slides (CEL Associates, Inc., Houston, TX), and stored at -80°C. Sections are 

20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine 
buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% . 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 

25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fashion. 

A NgR-specific probe may be generated using PCR. Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 

30 the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-off transcripts (i.e., cRNA riboprobes) using the vector-borne T7 promoter and 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 
5 and cognate polymerase. The riboprobes may be labeled with [ 35 S]-UTP to yield a 
specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and about 0.65 
x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mMEDTA, IX 

10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain cryosections may be independently exposed to 45 pi of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52°C to allow hybridization to occur. Equivalent series of cryosections are then 

15 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 p.g/ml 
KNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37°C. The cryosections are then subjected to three 

20 high-stringency washes in 0. 1 X SSC at 52°C for 20 minutes each. Following the 

series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and exposure to 
Kodak BioMax™ MR-1 film. After 13 days of exposure, the film is developed, and 
any significant hybridization signal is detected. Based on these results, slides containing 

25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emulsion and the slides are stored in the dark for 32 days. The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grains (generated by antisense probe 

30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains found between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect a NgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 

s 

decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 

5 Expression of NgR in the brain provides an indication that modulators of NgR 

activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR 
modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having 

10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
15 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe fragment for 32 P-labeling. The PCR is performed as follows: 

Mix: lpl 
2\il 

20 2\l\ 

IOjiI 

ini 

0.5|il 

25 

83.5*il 



NgR-containing plasmid 
fwd primer (10-50 pM) 
rev primer (10-50 pM) 

lOxPCR buffer (such as that provided with the enzyme, 
Amersham Pharmacia Biotech) 

lOmM dNTP (such as #1 969 064 from Boehringer Mannheim) 
Taq polymerase (such as #27-0799-62, Amersham Pharmacia 
Biotech) 
water 



BNSDOCID: <WO 0229O5SA3JA> 



WO 02/029059 



PCT/US01/31488 



- 117- 



PCR is performed in a Thermocycler using the following program: 



94°C 



5min 



94°C 
55°C 
72°C 



lmin 
Imin 
lmin 



30 cycles 



72°C 



lOmin 



10 



15 



20 



25 



The PCR product may be purified using QIAquick PCR Purification Kit 
(#28104) from Qiagen, and radictively labeled with 32 P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. 
Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD146-814) bands of an appropriate size are 
visualized. 

Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 
A. Expression of NgR in Mammalian Cells 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned 
into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, CA) and 
transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 
product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 jig/ml zeocin (Stratagene, LaJolla, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 
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allows for stable transformation of CHO cells. Optionally, NgR may be purified from 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. 
5 The NgR also may be expressed in-frame with a tag sequence (e.g., polyhistidine, 
hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR from the host cell. 

10 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasmid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 
selectable marker such as DHFR. 

C. Expression of NgR in 293 cells 

For expression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The forward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5' extension of 
nucleotides to introduce the Hindin cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5* extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the NgR sequence. The 
PCR conditions are 55 6 C as the annealing temperature. The PCR product is gel 
purified and cloned into the HindJU-XhoI sites of the vector. 
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The DNA is purified using Qiagen chromatography columns and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN). Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
5 antibodies. Permanently transfected cells are selected with Zeocin and propagated. 
Production of the recombinant protein is detected from both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D. Transient Expression of Nogo-R in COS cells 

For expression of the NgR in COS7 cells, a polynucleotide molecule having a 
nucleotide sequence of SEQ ID NO: 1, for example, can be cloned into vector p3-CI. 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

The forward primer is determined by routine procedures and preferably 
contains a 5 ! extension which introduces anXbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5'- 
extension of nucleotides which introduces a Sail cloning site followed by nucleotides 
which correspond to the reverse complement of a nucleotide sequence of SEQ ID 
NO:l. 

The PCR consists of an initial denaturation step of 5 min at 95 °C, 30 cycles of 
30 sec denaturation at 95°C, 30 sec annealing at 58°C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated into 
the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli cells 
for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 
expression. 

NgR expressed from a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 

BNSDOCID: <WO Q22905SA3_IA> 



10 



15 



20 



25 



30 



WO 02/029059 



PCTYUS01/31488 



- 120- 

example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80°C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID NO:3 or SEQ ID 
- NO:13. 

5 

E. Expression of NgR in Insect Cells 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ID NO : 1, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a 5* 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5 1 extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 

1 5 SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAcIMl. Other suitable vectors for the 
expression of NgR polypeptides can be used, provided that the vector construct 

25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
inLuckowe*<z/., Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al (1987) A MANUAL OF METHODS FOR 

30 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555. 
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In a preferred embodiment, pAcHLT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with 35 S-methionine at 24 hours 
5 post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expression. 

For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID 

10 NO: 1 3) can be amplified by PCR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Ndel and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography columns 

1 5 and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
further purification and expression optimization in EGG5 cells. 

20 F - Expression of soluble forms of NgR2 and NgR3 as NgR-Ig fusion 

proteins. 

To generate a NgR2-Ig fusion protein, standard methods may be used as 
described in the literature (e.g. Sanicola et al. (1997) Proc. Natl. Acad. Sci. USA. 94, 
6238-6243). For example, a DNA fragment encoding NgR2 without the sequence 

25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fragment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fusion protein. The fusion protein is then purified from 

30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directly onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 
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0.5 M NaCl, and 25 mM sodium phosphate, 100 mM NaCl (pH 5.0). The bound 
protein may then be eluted with 25 mM NaH 2 P0 4 , 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M Na2HP0 4 (pH 8.6). 
Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et al 
(1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A 

1 0 protocol is published in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 1 999, John 

Wiley & Sons, NY and Ausubel, F. M. etal 1992, SHORT PROTOCOLS IN MOLECULAR 
BIOLOGY, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available from Clontech, Palo Alto, CA 
(Matchmaker Two-Hybrid System 3). 

15 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e., 
pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming 

20 cDNA libraries, see Sambrook et al 1989, MOLECULAR CLONING: ALABORATORY 

MANUAL, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgR 
fusion construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 

25 analysis. Similar controls are performed with the AD/library fusion construct to ensure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids 
according to standard procedure (Ausubel, et al, 1992, SHORT PROTOCOLS IN 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 

30 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (i.e., lacZ t HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
P-galactosidase activity upon growth in Xgal 

(5-bromo-4-chloro-3-indolyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden et al. y (1985) Cold Spring Harb. Symp. 
5 Quant. BioL, 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion 
proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced 
to verify the presence of an open reading frame fused to GAL4 AD and to determine 
1 0 the identity of the NgR-binding protein. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding fragments 

15 thereof or variants thereof including "humanized" variants. Such protocols can be 
found, for example, in Sambrook et ah (1989), above, and Harlow et al (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 

20 the antibodies. In another embodiment, one or more peptides having amino acid 

sequences corresponding to an immunogenic portion of NgR (e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mixed with an adjuvant or linked to a 

25 hapten to increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic fragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manufacturer's recommendations. For an initial injection, the antigen is emulsified with 
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Freund f s Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
sampler is taken from the immunized mice and assayed by western blot to confirm the 
5 presence of antibodies that immunoreact with NgR Serum from the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 

10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 |ig/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifugation and resuspended in serum-free 
RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar 

15 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifiiged and washed as well. 

To produce hybridoma fusions, spleen cells from the immunized mice are 
combined with NS-1 cells and centrifiiged, and the supernatant is aspirated. The cell 

20 pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells are centrifiiged, resuspended in 
RPMI containing 15% FBS, 100 pM sodium hypoxanthine, 0.4 \iM aminopterin, 16 
jxM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10 6 

25 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates 
(Corning, Corning, NY). 

On days 2, 4, and 6 after the fusion, 100 |xl of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 
screened by ELIS A, testing for the presence of mouse IgG that binds to NgR. 

30 Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 
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B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential of NgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 
5 NgR antagonists. Following are protocols to improve the utility of anti-NgR 

monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in human 
hosts {i.e., to prevent human antibody response to non-human anti-NgR antibodies). 
The principles of humanization have been described in the literature and are 
10 facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 

1 5 with the constant domains of human antibody molecules. (See, e.g., Morrison et al, 
(1989) Adv. Immunol, 44, 65-92). The variable domains of NgR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 

20 the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 

25 into human antibody sequences. (See, e.g., Jones et al, (1986) Nature 321, 522-525; 
Riechmann et al, (1988) Nature 332, 323-327; Verhoeyen et al, (1988) Science 239, 
1534-1536 ; and Tempest etal, (1991) Bio/Technology 9, 266-271). If necessary, the 
B-sheet framework of the human antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 

30 domain of the original monoclonal antibody. (See Kettleborough et al, (1991) Protein 
Engin. 4, 773-783; and Foote et al, (1992) 7. Mol Biol 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-human antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) Mol Immunol 28, 489-498 . 
5 The foregoing approaches are employed using NgR-neutralizing anti-NgR 

monoclonal antibodies and the hybridomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame et al (1997) Hitman Antibodies 8, 155-168; 
Hoogenboom (1997) TIBTECH 15, 62-70; and Rader et al (1997), Curr. Opin. 
Biotechnol 8, 503-508, all of which are incorporated by reference. For example, 

15 antibody variable regions in the form of Fab fragments or linked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pDI. Expression of the fusion protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage library comprising such constructs is 

20 expressed in bacteria, and the library is screened for NgR-specific phage-antibodies 
using labeled or immobilized NgR as antigen-probe. 

D. Human NgR-neutralizing antibodies from transgenic mice 

Human NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al (1996) Immunol Today 17, 391-397 and 

Bruggemann et al (1997) Curr. Opin, Biotechnol 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies (e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 
5 analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are useful for 
identifying NgR in tissue samples (e.g., for diagnostic purposes, pathological purposes, 
10 and the like). Agonist and antagonist modulators are useful for up-regulating and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in NgR-transfected cells that have been exposed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 
20 e.g., Sutherland et al, (1968) Circulation 37, 279; Frandsen et al, (1976) Life 

Sciences 18, 529-541; Dooley etal, (1997) J. Pharmacol Exp. Therap. 283, 735-41; 
and George et al, (1997) J. Biomol. Screening 2, 235-40). An exemplary protocol for 

(g) TM 

such an assay, using an Adenylyl Cyclase Activation FlashPlate Assay from NPEN 
Life Science Products, is set forth below. 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 

30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds (i.e., candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 

5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing [ 125 I]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ 125 I]-cAMP compete for antibody bound to the 

10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the G s subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 

1 5 levels. Agonists of receptors which couple to the Gy 0 subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutively active or activated by known agonists. 

20 B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with both 
a NgR expression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable 
luminescence that is proportional to the amount of intracellular (cytoplasmic) free 

25 calcium. (See generally, Cobbold, et ah "Aequorin measurements of cytoplasmic free 
calcium," In: McCormack J.G. and Cobbold P.H., eds., CELLULAR CALCIUM: A 
Practical APPROACH. Oxford:IRL Press (1991); Stables et aL, (1997) Anal. 
Biochem. 252, 115-26; andHaugland, HANDBOOK OF FLUORESCENT PROBES AND 
Research Chemicals. Sixth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial expression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 37°C in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine sen am, 2 mM glutamine, 10 
5 U/ml penicillin and 10 jig/ml streptomycin, at which time the medium is changed to 
serum-free MEM containing 5 \iM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

10 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 pl/well. Plates 
are then loaded onto an MLX microliter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 |xl cell suspensions into 
each well, one well at a time, and immediately read luminescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC 50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the G q subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 
as inverse agonists will reverse this effect at receptors that are either constitutive!/ 
active or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

25 The photoprotein luciferase provides another useful tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct (e.g., NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B. Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George et al. (1997) J. Biomol. Screening 2, 235- 
240; and Stratowa et al. (1995) Cvrr. Opin. Biotechnol. 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e.g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 ceils/well one day prior to transfection and cultured at 37°C in 

5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 raM glutamine, 10 
U/ml penicillin and 10 p.g/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
luciferase may be purchased from Stratagene (Legally, CA). Transfections are 

10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37°C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

15 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 |xl of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After incubation for 15 minutes at room temperature, 15 |xl of 
the lysate is mixed with 50 jil of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3-20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 



D. Intracellular calcium measurement using FLEPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x 10 4 cells/well in Packard black-walled, 96-well plates 
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specially designed to discriminate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37°C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 
5 AM, Fluo-4™ AM, Calcium Green™-1 AM, or Oregon Green™ 488 BAPTA-1 AM), 
each at a concentration of 4 \iM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and incubated for 10 minutes at 37°C to remove 
residual dye from the cellular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 

10 the calcium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23 187 (10 pM; positive control), or 
ATP (4 pM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al 

15 (1998) Drug Dev. Res. 44, 14-20). The F-stop for the detector camera is set at 2.5 and 
the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23187, and the basal 
fluorescence level is subtracted from the response signal. The calcium signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calcium 

20 ionophore A23 1 87 and ATP increase the calcium signal 200% above baseline levels. 
In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. [ 35 S]GTPyS Binding Assay 

25 It is also possible to evaluate whether NgR signals through a G 

protein-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins whose activity involves GTP binding and hydrolysis to yield 
bound GDP, measurement of binding of the non-hydrolyzable GTP analog [ 35 S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 

30 for modulator activity. (See, e.g., Kowal et aL 9 (1998) Neuropharmacology 37, 179- 
187.). 
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In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca 2+ /Mg 2+ -free phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifugation (500 x g, 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 

nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml 
TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifuged at 20,000 x g for 20 minutes to 

10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl 2 , 
1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70°C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 

15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mM NaCl, 10 pM GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 |ig/ml. In a final volume of 90 pi, homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 pM GTP for 
30 minutes at 30°C and then placed on ice. To each sample, 10 \il guanosine 

20 5 f -0-(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTPyS), was added to a 

final concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4°C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed with 

25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl 2 - Filters are counted by liquid 
scintillation spectroscopy. Nonspecific binding of [ 35 S]-GTPyS is measured in the 
presence of 100 pM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [ 35 S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gives up to a five-fold increase in 

30 [ 35 S]-GTPyS binding. This response is blocked by antagonists. 

F. [ 3 H] Arachidonic Acid Release 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators of NgR activity. (See, e.g., 
Kanterman etal, (1991) Mol Pharmacol 39,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24-well plates at a 
5 density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 p.g/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[ 3 H]-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 p,Ci/ml in 1 ml MEM 
supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum 
10 albumin for 2 hours at 37°C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 pM ATP and the cells are incubated at 37°C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
15 well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [^-arachidonic 
acid. This potentiation is blocked by antagonists. 

G. Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators of NgR activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e.g., Dunlop et al (1998) J. Pharmacol Toxicol Meth. 40, 47-55). In one 
embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10 5 cells/cup in MEM 

25 supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 
10 |ig/ml streptomycin. The cells are incubated in this medium at 37°C in 5% C0 2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 ng/ml streptomycin, 26 mM NaCl) at a flow rate of 100 jil/minute. 
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Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 

receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

15 Example 15: mNgR3 does not bind hNogo-A(1055-1120) 

To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(1055-l 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3 protein was created. The mouse NgR3 cDNA was amplified by PCR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 

20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature mNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a myc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-1120) binding studies 
and myc immunohistology were conducted as described (Fournier et al., supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but binding to 

AP-hNogo-A(1055-l 120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 
The tblastn program was used to search for the human homolog of mouse 
30 NgH3. The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a 

proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 3 12 amino acids in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID NO:4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R35699; Version number: 
R3 5699.1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that carboxy terminal end, the I.M. A.G.E. Consortium clone No. 383 19, 
which corresponds to Genbank accession No. R35699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/HinD HI fragment containing the sequence of interest, cloned into the 

1 5 Notl/HinD m sites of the vector Lafinid B A 

(http://image.llnl.govAimage/htm^bs/lafinidBAshtml). The clone was received as an 
agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 

20 Purification System (Promega #A7500). These DNAs were subsequently digested 
with NotI and HinD IH restriction enzymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 

25 sequence for human NgR3 of 392 amino acids (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989. 1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 

30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989.1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989. 1). This results in two changes in amino 
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acids when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989. 1: SEQ ID NO: 14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO: 14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989. 1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R35699.1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
positions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R35699. 1; GI: 792600) This leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucine. 

15 A Bestfit analysis of the partial human amino acid sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 amino acids, it can be determined that the 

20 human NgR3 protein contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential glycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 

25 As those skilled in the art will appreciate, numerous changes and modifications 

may be made to the preferred embodiments of the invention without departing from the 
spirit of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional application 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 



SEQIDNO:l 


human NgR2 cDNA sequence derived from genomic sequence 




AC013606 


SEQIDNO:2 


human NgR2 amino acid sequence 


SEQIDNO.3 


mouse NgR3 cDNA sequence derived from AC021768 


SEQ ID NO:4 


a mouse NgR3 amino acid sequence 


SEO ID NO 5 


a human NgRl amino acid sequence 


SEO ID NO 6 


a consensus amino acid sequence for NgRs 


SEO ID NO 7 


#1055-1120 amino acid residues of hNogoA (Nogo-66) 


SEQ IDNO:8 


a mature human NgR2 amino acid sequence 


SEO ID NO 9 


a mature mouse NgR3 amino acid sequence 


SEO ID NO 10 


a consensus NgR LLRNT amino acid sequence 


SEO ID NO 11 


a consensus NgR LRRCT domain amino acid sequence 


SEQ ID NO: 12 


a consensus NgR LRR domain amino acid sequence 


SEO ID NO * 13 


a nartial human NeR3 nucleotide seauence 


SEO ID NO 14 


a partial human NgR3 amino acid sequence 


ott/^ ttp^ XTW 1 ^ 
cjJiQ JLD JNO. ID 


a genomic sequence encoumg d numan lNgivZf auqucnuc. 


SEQ ID NO: 16 


a genomic sequence (complementary strand) encoding a mouse 




NgR3 


SEQIDNO:17 


a mouse NgRl amino acid sequence 


SEQ ID NO: 18 


a consensus sequence for the NTLRRCT domain of NgR 


SEQ ID NO: 19 


an consensus NgR LRRCT domain amino acid sequence 



25 



30 
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CLAIMS 

What is claimed is: 

5 

1. An isolated nucleic acid comprising a nucleotide sequence encoding 
a polypeptide comprising an LRRCT domain consisting of the amino acid sequence: 

NX I WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX 1I V 

10 

X 12 C X 13 X 14 P X 15 X 16 X 17 X Ig X 19 X 20 D L X 21 X22 L X23 X 24 X^ D 
X 26 X 27 X 28 C [SEQ ID NO: 19] 

1 5 wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 

2. The isolated nucleic acid according to claim 1, wherein X 17 and X^ 
20 are independently selected from the group consisting of: arginine and lysine. 

3. The isolated nucleic acid according to claim 2 5 wherein the amino 
acid sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-3 10 of SEQ ID NO:2 and residues 261-3 10 of SEQ ID NO: 2 with up to 10 

25 conservative amino acid substitutions. 

4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2. 

5. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 

6. The isolated nucleic acid according to claim 1, wherein the 
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polypeptide comprises: (a) aNTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
first 39 amino acids of the CTS domain. 

5 7. The isolated nucleic acid to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic acid consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide selected 
10 from the group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ ID 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid 
is from 8 to 100 nucleotides in length. 

15 9. A vector comprising the nucleic acid of any one of claims 1, 4 or 5. 

10. A host cell comprising a vector according to claim 9. 

11. A polypeptide comprising a LRRCT amino acid sequence: 

20 

NX 1 WX 2 CX 3 CRA-RX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 S S SX n V 

X 12 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 Xjq D L X 21 X22 L X23 X 24 X 25 D 

25 X 26 X^ X 28 C [SEQ ID NO: 19] 

wherein X is any amino acid residue or a gap and the polypeptide does not 
comprise the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

30 

12. The polypeptide according to claim 11, wherein X 17 and X^ is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 1 1, wherein X 19 is glycine. 
[SEQ ID NO: 11] 

14. The polypeptide according to claim 11, wherein the amino acid 

5 sequence is selected from the group consisting of residues 261— 3 10 of SEQ ID NO:2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID NO:4 and amino 
acid sequences thereof comprising a conservative substitution. 



15. A polypeptide comprising a NTLRRCT amino acid sequence: 



10 



15 





112 



20 




25 



wherein X is any amino acid residue or a gap and wherein the polypeptide is 
not the polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 
(mouse NgRl). 



16. The polypeptide according to claim 15, wherein Xg, X 37 and X, 



^38 



30 



represents a gap. 



17. A polypeptide comprising an amino sequence selected from the 
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group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 11, 15 or 17, wherein 
the polypeptide comprises: (a) an NTLRRCT domain, and (b) less than a complete 

5 CTS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the amino acid sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherein the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 11, 15 or 17, comprising the steps of introducing an isolated nucleic acid 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 1 1, 15 

25 or 17. 

24. A composition comprising the polypeptide of claim 11, 15 or 17 
and a phannaceutically acceptable carrier. 

30 25. A composition comprising the antibody of claim 23 and a 

phannaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 11, 15 or 17. 

5 27. A method of treating a central nervous system disease, disorder or 

injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 

antibody according to claim 23. 

29. A method of treating a central nervous system disease, disorder or 
injury, comprising administering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
claim 11, 15 or 17 comprising the steps of: 

(a) providing a polypeptide of claim 1 1, 15 or 17; 
20 (b) contacting the polypeptide with the candidate molecule; 

and 

(c) detecting binding of the candidate molecule to the 
polypeptide. 



25 
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FIG. 3 
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FIG . 3 , cont • 
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SEQUENCE LISTING 



<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 
CATE, RICHARD L. 
SAH, DINAH W.Y. 

<120> NOGO RECEPTOR HOMOLOGS 

<130> A116PCT 

<140> 
<141> 

<150> 60/238,361 
<151> 2000-10-06 

<160> 16 

<170> Patentln Ver. 2.1 

<210> 17 

<211> 1260 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa gctcccgcct cggcctgcct cctgctgatg 60 
ctcctggccc tgcccctggc ggcccccagc tgccccatgc tctgcacctg ctactcatcc 120 
ccgcccaccg tgagctgcca ggccaacaac ttctcctctg tgccgctgtc cctgccaccc 180 
agcactcagc gactcttcct gcagaacaac. ctcatccgca cgctgcggcc aggcaccttt 240 
gggtccaacc tgctcaccct gtggctcttc tccaacaacc tctccaccat ctacccgggc 300 
actttccgcc acttgcaagc cctggaggag ctggacctcg gtgacaaccg gcacctgcgc 360 
tcgctggagc ccgacacctt ccagggcctg gagcggctgc agtcgctgca tttgtaccgc 420 
tgccagctca gcagcctgcc cggcaacatc ttccgaggcc tggtcagcct gcagtacctc 480 
tacctccagg agaacagcct gctccaccta caggatgact tgttcgcgga cctggccaac 540 
ctgagccacc tcttcctcca cgggaaccgc ctgcggctgc tcacagagca cgtgtttcgc 600 
ggcctgggca gcctggaccg gctgctgctg cacgggaacc ggctgcaggg cgtgcaccgc 660 
gcggccttcc gcggcctcag ccgcctcacc atcctctacc tgttcaacaa cagcctggcc 720 
tcgctgcccg gcgaggcgct cgccgacctg ccctcgctcg agttcctgcg gctcaacgct 780 
aacccctggg cgtgcgactg ccgcgcgcgg' ccgctctggg cctggttcca gcgcgcgcgc 840 
gtgtccagct ccgacgtgac ctgcgccacc cccccggagc gccagggccg agacctgcgc 900 
gcgctccgcg aggccgactt ccaggcgtgt ccgcccgcgg cacccacgcg gccgggcagc 960 
cgcgcccgcg gcaacagctc ctccaaccac ctgtacgggg tggccgaggc cggggcgccc 1020 
ccagccgatc cctccaccct ctaccgagat ctgcctgccg aagactcgcg ggggcgccag 1080 
ggcggggacg cgcctactga ggacgactac tgggggggct acgggggtga ggaccagcga 1140 
ggggagcaga tgtgccccgg cgctgcctgc caggcgcccc cggactcccg aggccctgcg 1200 
ctctcggccg ggctccccag ccctctgctt tgcctcctgc tcctggtgcc ccaccacctc 1260 



<210> 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Leu Pro Gly Leu Arg Arg Leu Leu Gin Ala Pro Ala Ser Ala Cys 
1 5 10 15 

Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala Pro Ser Cys Pro 
20 25 30 
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Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys Gin Ala 
35 40 45 

Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr Gin Arg 
50 55 60 

Leu Phe Leu Gin Asn Asn Leu He Arg Thr Leu Arg Pro Gly Thr Phe 
65 70 75 B0 

Glv Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 
85 90 95 

He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu Leu Asp 
100 105 HO 

Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gin 
115 120 125 

Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin Leu Ser 
130 135 140 

Ser Leu Pro Gly Asn He Phe Arg Gly Leu Val Ser Leu Gin Tyr Leu 
145 ^ 150 155 160 

Tvr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu Phe Ala 
165 170 175 

Asp Leu Ala Asn Leu Ser His Leu Phe Leu His Gly Asn Arg Leu Arg 
180 185 190 

Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp Arg Leu 
195 200 205 

Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala Phe Arg 
210 ~ 215 220 

Gly Leu Ser Arg Leu Thr He Leu Tyr Leu Phe Asn Asn Ser Leu Ala 
225 230 235 240 

Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu 
245 250 255 

Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 
260 265 270 

Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val Thr Cys 
275 280 285 

Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu Arg Glu 
290 295 300 

Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro Gly Ser 
305 310 315 320 

Ara Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu 
325 330 335 

Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 
340 345 350 

Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr Glu Asp 
355 360 365 

Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu Gin Met 
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370 375 380 

Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 
385 " 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 
405 410 415 

Pro His His Leu 
420 



<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 



<400> 3 

atgtcttggc 

gggtgctgtg 

tgtcctcgag 

tttgctgcca 

cgcatcacct 

tccaacaaca 

ctagaccttg 

gtgaagcttc 

tttggtggcc 

caagatgaca 

ctatggagcc 

catgagaacc 

accctctttc 

gtggccttgg 

tccctgtggg 

cccgagctgc 

acaggaccag 

gcccgcaagg 

catccgcctg 

aaccggaacc 

gcccccgact 

aagggcaagt 

tcaggcacgg 

cgc 



agtctggaac 
tggaattgct 
actgtgtgtg 
tcccggaggg 
tcctccagca 
tcactttcat 
gagacaaccg 
acgccctcta 
tgcacagcct 
tctttgtgga 
tgggccaagg 
agctacagtg 
tcttcaacaa 
agttccttcg 
aatggctgcg 
ggcaaggcca 
tgtctcctca 
agcaccatcc 
gctccaggtc 
agatctctaa 
atcagcacaa 
gtgctcgcag 
cccttggggc 



cacagtgaca 
gctgttgctg 
ctaccctgcg 
catcccagag 
gggccacttc 
tgctcccaac 
acagctgcga 
cctctataag 
gcagtatctc 
cctggtcaat 
catcttccgg 
ggttcaccac 
cagcctcact 
cctcaatggg 
aaggttccgt 
ggatctgaag 
ccagatcaag 
gtcccatggg 
aggttacaag 
ggtgagctct 
gttcagcttt 
gacccccatc 
cccactcctg 



caatctcccg 
ctcgctggag 
cccatgactg 
gacagtgagc 
agccccgcca 
accttcgagg 
acgctggcac 
tgtggactga 
tacttgcagg 
ctcagtcact: 
ggcctggtga 
aaggctttcc 
gagctgcagg 
aatgcttggg 
ggctctagct 
ctgctgaggg 
tctcacacgc 
gcctccaggg 
aaggcaggca 
gggaaagagc 
gacatcatgc 
cgtgccccca 
gcctggatac 



tgcaggctgc 
agctacctct 
tcagctgcca 
gcatcttcct 
tggtcaccct 
gctttgtgca 
ccgagacctt 
gcgccctgcc 
acaaccatat 
tgtttctcca 
acctggaccg 
atgacctcca 
gtgactgtct 
actgtggctg 
ctgctgtccc 
tggaggactt 
ttaccacctc 
acaaaggcca 
agaactgcac 
ttaccgaact 
ccaccgcacg 
gtggggtgca 
tggggctggc 



tcaggtctca 
gggtggtggt 
ggcacacaac 
gcagaacaat 
ctggatctac 
tctggaggag 
ccaaggcctg 
cgcaggcatc 
cgagtacctc 
tggtaacaag 
gttgctgctg 
caggctaacc 
ggcccccctg 
ccgggcacgt 
ctgcgcgacc 
ccggaactgc 
tgacagggct 
cccacatggc 
cagccacagg 
gcaggactat 
acccaagagg 
gcaggcatcc 
agtcactctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1383 



<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
15 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala He 
50 55 60 

Pro Glu Gly He Pro Glu Asp Ser Glu Arg He Phe Leu Gin Asn Asn 
65 70 75 80 
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Ara He Thr Phe Leu Gin Gin Gly His Phe Ser Pro Ala Met Val Thr 
y 85 90 95 

Leu Trp He Tyr Ser Asn Asn He Thr Phe He Ala Pro Asn Thr Phe 
100 1° 5 110 

Glu Glv Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin 
115 120 125 

Leu Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His 
130 135 140 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly lie 
145 ^ 150 155 160 

Phe Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His 

170 1/5 



165 



He Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser 
180 185 190 

His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gin Gly He 
195 200 205 

Phe Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin 
210 215 220 

Leu Gin Trp Val His His Lys Ala Phe His Asp Leu His Arg Leu Thr 
225 ~ 230 235 240 

Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gin Gly Asp Cys 

250 255 



245 



Leu Ala Pro Leu Val Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 
260 265 270 

Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Arg Arg 
275 280 285 

Phe Arg Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 
290 295 300 

Gin Gly Gin Asp Leu Lys Leu Leu Arg Val Glu Asp Phe Arg Asn Cys 
305 310 315 320 

Thr Glv Pro Val Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr 
325 330 335 

Ser Asp Arg Ala Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 
340 345 350 

Arg Asp Lys Gly His Pro His Gly His Pro Pro Gly Ser Arg Ser Gly 
355 360 365 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gin 
370 375 380 

He Ser Lys Val Ser Ser Gly Lys Glu Leu Thr Glu Leu Gin Asp Tyr 
385 390 " ~ 395 400 

Ala Pro Asp Tyr Gin His Lys Phe Ser Phe Asp He Met Pro Thr Ala 
405 410 415 

Arg Pro Lys Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala 
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420 425 430 

Pro Ser Gly Val Gin Gin Ala Ser Ser Gly Thr Ala Leu Gly Ala Pro 
435 440 445 

Leu Leu Ala Trp lie Leu Gly Leu Ala Val Thr Leu Arg 
450 455 460 



<210> 5 
<211> 473 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Gin Val Ala Ala Pro Cys Pro Gly Ala Cys Val 
20 ~ 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 " 40 45 

Gin Ala Val Pro Val Gly lie Pro Ala Ala Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 
65 70 75 80 

Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Val Leu Ala Arg He 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu Glu Gin Leu Asp Leu 
100 105 HO 

Ser Asp Asn Ala Gin Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly 
115 120 125 

Leu Gly Arg Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Gin Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 " 155 160 

Leu Gin Asp Asn Ala Leu Gin Ala Leu Pro Asp Asp Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg He Ser Ser 
180 185 190 

Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 ~ 200 205 

Leu His Gin Asn Arg Val Ala His Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Ala 
225 " 230 ~ 235 240 

Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
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260 



265 



270 



Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Ser 
275 280 285 

Leu Pro Gin Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 



Asp Leu Gin Gly Cys Ala Val Ala Thr Gly Pro Tyr His Pro He Trp 
305 " 310 315 320 

Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys 
325 ' 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Ser 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His He Asn Asp Ser Pro Phe 
370 ~ 375 380 

Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro Leu Thr Ala Val Arg Pro 
385 390 395 400 

Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser Gly Pro Arg Arg Arg 
405 410 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
420 425 430 

Gin Ala Gly Ser Gly Gly Gly Gly Thr Gly Asp Ser Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 



<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> M0D_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (9).. (10) 



290 



295 



300 
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<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (27) . . (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (31).. (34) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (36) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (39).. (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (44) . . (50) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (52) . . (59) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (62).. (63) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (66) . . (69) 

<223> Variable amino acid 

<220> 

<221> MOD RES 
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<222> (71) . . (74) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (76) . . (80) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (83) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (87) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (90) . . (91) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (94) . . (96) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (98) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (101) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104) . . (105) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (107) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (111) . . (112) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 
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<221> MOD_RES 

<222> (116) . . (117) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (119) . . (122) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (124) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (127) . . (128) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (136) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (138) . . (141) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (143) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (146) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (151) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (154) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (162) • . (170) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (175) . . (176) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (184) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (186) . . (189) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (196) . . (197) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (199) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (202) . . (203) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (205) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (210) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (212) . . (213) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (215) . . (218) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (221) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (223) • . (224) 

<223> Variable amino acid 
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<22 0> 

<221> MOD_RES 

<222> (226) . • (227) 

<22 3> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (232) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (234) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (236) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (238) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (243) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (246) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (248) . . (251) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (253) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (257) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (259) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (261).. (262) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (264) . . (267) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (272) . . (273) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (275) . . (277) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (279) . . (281) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (283) . . (287) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (289) . . (290) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (292) . • (328) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (330) . . (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (344) . . (346) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (348) . . (399) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (401) . . (428) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa 
20 25 30 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn Xaa lie Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser 
50 55 60 

Asn Xaa Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 
85 90 95 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa 
100 105 HO 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 
115 120 125 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp 
130 " 135 140 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly 
145 150 155 160 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa 
165 17 0 175 

Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa 
180 185 190 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa 
195 200 205 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 
210 215 220 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys Arg 
225 230 235 240 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser 
245 250 255 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 
260 265 270 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Pro 
275 280 285 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 
340 345 350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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355 360 365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg 
385 390 395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 410 415 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
420 425 430 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 
435 440 



<210> 7 
<211> 66 
<212> PRT 

<213> Homo sapiens 

<400> 7 , _ 

Arg He Tyr Lys Gly Val He Gin Ala He Gin Lys Ser Asp Glu Gly 
1 5 10 15 

His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala He Ser Glu Glu 
20 25 30 

Leu Val Gin Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 
35 * 40 45 

He Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser 
50 55 60 

Leu Lys 
65 



<210> 8 
<211> 390 
<212> PRT 

<213> Homo sapeins 
<400> 8 

Cys Pro Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys 
I 5 10 15 

Gin Ala Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr 
20 25 30 

Gin Arg Leu Phe Leu Gin Asn Asn Leu He Arg Thr Leu Arg Pro Gly 
35 40 45 

Thr Phe Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu 
50 55 60 

Ser Thr He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr 
85 90 95 
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Phe Gin Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin 
100 105 110 

Leu Ser Ser Leu Pro Gly Asn lie Phe Arg Gly Leu Val Ser Leu Gin 
115 ~ 120 125 

Tyr Leu Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu 
130 ~ 135 140 

Phe Ala Asp Leu Ala Asn Leu Ser His Leu Phe Leu His Gly Asn Arg 
145 150 155 160 

Leu Arg Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp 
165 170 175 

Arg Leu Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala 
180 "* 185 190 

Phe Arg Gly Leu Ser Arg Leu Thr lie Leu Tyr Leu Phe Asn Asn Ser 
195 200 205 

Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu 
210 " 215 220 

Phe Leu Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg 
225 230 235 240 

Pro Leu Trp Ala Trp Phe Gin Arg Ala Arg Val" Ser Ser Ser Asp Val 
245 250 255 

Thr Cys Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu 
260 265 270 

Arg Glu Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 
275 280 285 

Gly Ser Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val 
290 295 300 

Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp 
305 310 315 320 

Leu Pro Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr 
325 330 335 

Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu 
340 345 350 

Gin Met Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly 
355 360 365 

Pro Ala Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu 
370 375 380 

Leu Val Pro His His Leu 
385 390 



<210> 9 
<211> 421 
<212> PRT 
<213> Mus sp. 

<400> 9 
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Cvs Pro Arq Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 
1 5 10 15 

Gin Ala His Asn Phe Ala Ala He Pro Glu Gly He Pro Glu Asp Ser 
20 25 30 

Glu Arg He Phe Leu Gin Asn Asn Arg He Thr Phe Leu Gin Gin Gly 
35 40 45 

His Phe Ser Pro Ala Met Val Thr Leu Trp He Tyr Ser Asn Asn He 
50 55 60 

Thr Phe He Ala Pro Asn Thr Phe Glu Gly Phe Val His Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg Gin Leu Arg Thr Leu Ala Pro Glu Thr 
85 90 95 

Phe Gin Gly Leu Val Lys Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 
100 105 HO 

Leu Ser Ala Leu Pro Ala Gly He Phe Gly Gly Leu His Ser Leu Gin 
115 120 125 

Tvr Leu Tyr Leu Gin Asp Asn His He Glu Tyr Leu Gin Asp Asp He 
130 135 140 

Phe Val Asp Leu Val Asn Leu Ser His Leu Phe Leu His Gly Asn Lys 
145 150 155" 160 

Leu Trp Ser Leu Gly Gin Gly He Phe Arg Gly Leu Val Asn Leu Asp 
165 170 175 

Ara Leu Leu Leu His Glu Asn Gin Leu Gin Trp Val His His Lys Ala 
180 185 190 

Phe His Asp Leu His Arg Leu Thr Thr Leu Phe Leu Phe Asn Asn Ser 
195 200 205 

Leu Thr Glu Leu Gin Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp Cys Gly Cys Arg Ala Arg 
225 230 235 240 

Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala Val 
245 250 255 

Pro Cys Ala Thr Pro Glu Leu Arg Gin Gly Gin Asp Leu Lys Leu Leu 
260 265 270 

Arg Val Glu Asp Phe Arg Asn Cys Thr Gly Pro Val Ser Pro His Gin 
275 280 285 

He Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 
290 295 300 

His His Pro Ser His Gly Ala Ser Arg Asp Lys Gly His Pro His Gly 
305 310 315 320 

His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys Ala Gly Lys Asn Cys 
325 330 335 

Thr Ser His Arg Asn Arg Asn Gin He Ser Lys Val Ser Ser Gly Lys 
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340 



345 350 



Glu Leu Thr Glu Leu Gin Asp Tyr Ala Pro Asp Tyx Gin His Lys Phe 
355 360 365 

Ser Phe Asp lie Met Pro Thr Ala Arg Pro Lys Arg Lys Gly Lys Cys 
370 375 380 

Ala Arg Arg Thr Pro lie Arg Ala Pro Ser Gly Val Gin Gin Ala Ser 
385 " ^ 390 395 400 

Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu Ala Trp lie Leu Gly Leu 
405 410 415 

Ala Val Thr Leu Arg 
420 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 

<222> (9).. (10) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (12).. (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 



Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
1 5 10 15 

Cys 



<210> 11 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (14) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (16) . . (19) 

<22 3> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (21) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (25) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (27) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (29) . . (30) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (37) 

<223> Variable amino acid 
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<220> 

<221> M0D_RES 

<222> (40) . . (41) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (43) . . (45) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (47) . . (49) 

<223> Variable amino acid 

<400> 11 

Asn Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 
15 10 15 

Xaa Xaa Xaa Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 
20 25 30 

Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa 
35 "* 40 45 

Xaa Cys 
50 



<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD RES 

<222> (5) . . (6) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (8) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (10) • . (16) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 
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<221> MOD_RES 

<222> (28) . . (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (37) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (42) . . (46) 

<22 3> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (49) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (53) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (56) . . (57) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (60) . . (62) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (64) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (67) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (70) . . (71) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (73) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (75) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (77).. (78) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (80) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (82).. (83) 

<223> Variable amino acid 

<22 0> 

<221> MOD_RES 

<222> (85).. (88) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (90) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 

<222> (93).. (94) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (102) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104) . . (107) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (112) 

<223> Variable- amino acid 
<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (117) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (120) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (128) - . (136) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (141) . . (142) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (150) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (152) . . (155) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (158) . . (159) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (162) . . (163) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (165) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (168) . . (169) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (171) 
<223> Variable 



amino acid 



<220> 

<221> MOD_RES 
<222> (174) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (176) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (178) . . (179) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (181) . . (184) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (187) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (189) . . (190) 

<223> Variable amino acid 



<220> 

<221> MODRES 

<222> (192) . . (193) 

<223> Variable amino acid 



Arg°Xai 2 phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 * 5 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa Cys Xaa 
65 70 75 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tvr Leu Tvr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
Y 100 105 HO 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 



Xaa Leu Arg Leu 
195 



<210> 13 

<211> H76 

<212> DNA 

<213> Homo sapiens 



<400> 13 

gagggcatcc ccgrtggacag cgagcgcgtc 
cagcccggcc acttcagccc cgccatggtc 



ttcctgcaga acaaccgcat cggcctcctc 60 
accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc ccagcacctt cgagggcttc 
aaccggcagc tgcggacgct ggcacccgag 
ctctacctct acaagtgtgg gctcagcgcc 
agcctgcagt acctctacct gcaggacaac 
gtggacctgg tcaacctcag ccacctgttt 
ccgggcacct tccggggcct ggtgaacctg 
cagtgggtcc accacaaggc attccacgac 
aacaacagcc tctcggagct gcagggtgag 
ctccgcctca acggcaaccc ctgggactgt 
ctgcagaggt tccggggctc cagctccgct 
ggccaggacc tgaagctgct gagggccgag 
ccgcaccaga tcaagtcaca cacgctcacc 
cactcacccc acggccccac caggagcaag 
aggaagccgg ggaagaactg caccaacccc 
gccgggaaac aggcccccga gctgccagac 
tttgacatca tgcctacggc ccggcccaag 
atccgtgccc ccagcggggt gcagcaggcc 
ctggcctgga cactggggct ggcggtcact 



gtgcacctgg aggagctgga cctcggcgac 180 
accttccagg gcctggtgaa gcttcacgcc 240 
ttgccggccg gcgtctttgg cggcctgcac 300 
cacatcgagt acctccagga cgacatcttc 360 
ctccacggca acaagctgtg gagtctgggc 420 
gaccgtcttt tgctgcacga gaaccagctg 480 
ctccgcaggc tgaccaccct cttcctcttc 54 0 
tgcctggccc cgctgggggc cctggagttc 600 
ggttgtcgcg cgcgctccct gtgggaatgg 660 
gtcccctgtg tgtcccctgg gctgcggcac 720 
gacttccgga actgcacggg accagcgtcc 780 
accaccgaca gggccgcccg caaggaacac 840 
ggccacccgc acggcccccg gcccggccac 900 
aggaaccgca atcagatctc taaggcgggc 960 
tatgccccag actaccagca caagttcagt 1020 
aggaagggca agtgtgcccg caggaccccc 1080 
tcctcggcca gttccctggg ggcctccctc 1140 
ctccgc 1176 



<210> 14 
<211> 392 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Glu Glv He Pro Val Asp Ser Glu Arg Val Phe ■ Leu Gin Asn Asn Arg 
! 5 10 15 

He Gly Leu Leu Gin Pro Gly His Phe Ser Pro Ala Met Val Thr Leu 
20 25 30 

Trp He Tyr Ser Asn Asn lie Thr Tyr He His Pro Ser Thr Phe Glu 
35 40 45 

Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin Leu 
50 55 60 

Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His Ala 
65 70 75 80 

Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 
85 90 95 

Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His He 
100 105 HO 

Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser His 
115 120 125 

Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Pro Gly Thr Phe 
130 135 140 

Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin Leu 
145 150 155 160 

Gin Trp Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr 
165 170 175 

Leu Phe Leu Phe Asn Asn Ser Leu Ser Glu Leu Gin Gly Glu Cys Leu 
180 185 190 

Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Pro Trp 
195 200 205 
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Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Gin Arg Phe 
210 " 215 220 

Ara Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro Gly Leu Arg His 
225 230 235 240 

Glv Gin Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr 
245 " 250 255 

Gly Pro Ala Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr Thr 
260 265 270 

Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His Gly Pro Thr Arg 
275 280 285 

ser Lys Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 
290 ** 295 300 

Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gin He Ser Lys Ala Gly 
305 310 315 320 

Ala Gly Lys Gin Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gin 
325 330 335 

His Lys Phe Ser Phe Asp He Met Pro Thr Ala Arg Pro Lys Arg Lys 
340 345 350 

Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin 
355 360 365 

Gin Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 
370 375 380 

Leu Gly Leu Ala Val Thr Leu Arg 
385 390 



<210> 15 

<211> 143899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modified_base 

<222> (2044) . . (2144) 

<223> a, t, c f g, other or unknown 

<220> 

<221> modif ied_base 
<222> (6609) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (6625) - . (6724) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (14153) . . (14252) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
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<222> (19512) . . (19611) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (22595) (22694) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (27825) (27924) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied__base 

<222> (34953) . . (35052) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (49000) . . (49099) 

<223> a, t, c # g, other or unknown 

<220> 

<221> modified_base 

<222> (62884) . . (62983) 

<223> a, t, c, g f other or unknown 

<220> 

<221> modif ied_base 

<222> (75528) . . (75627) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modified_base 

<222> (87944) (B8043) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (111030) . . (111129) 

<223> a, t, c, g, other or unknown 



<400> 15 

aagcacatac 

ataccatcct 

tacagggaac 

tggacgcagg 

agcccaagtg 

ccctgagcca 

tcaatgccgg 

gaaggtgggc 

gatcaagctg 

tgtgaccacc 

tgccttcatg 

tctggtcaag 

tcatggttga 

ggggcatgct 

gattcagggt 



aggtgacatt 

caaacagctg 

aaagtgactt 

ctgtctgact 

aaaaacagct 

agctctgagc 

gtaaaggagt 

tctgtgtgac 

agtgggctgg 

atgggccaga 

tcaccccagg 

gtgccggcct 

gtatttattc 

cctcatttac 

taccctgctg 



acagaactga 

tatgatgtag 

tccaaaggtc 

ctggagtctg 

gatccactct 

aacagaaaca 

gctggaaacc 

tggttcctct 

ccctggacca 

ggctcagcca 

atgttcgggg 

tgggggatag 

gcacccctct 

agatgggcac 

gattcctggg 



cagttatgcc 
attgggtatt 
atgccattca 
aggtgttaat 
tcatttactt 
gaagctctgc 
gctgggatgg 
acactctggc 
caaggtgaca 
gaaggccccc 
cacctcagaa 
tgatggtggg 
tcagtcctta 
atcaaagctc 
attggggagg 



aggcactgta 
aaccccatta 
aaggagggtg 
gctgcctcct 
ggcatctgtg 
attaggagct 
ccgccgagca 
ctggctgcct 
ggtgacctct 
agcctggagg 
aaccttcctg 
tcctcatatt 
caacacccca 
agctaacgct 
gaggagcttc 



cttagcccct 
ataacaaaag 
aatcttaggt 
ccatgggaac 
ctaagctggt 
tgtgagcatg 
ctaggccgtt 
gcaggaagaa 
tctacaccca 
acggttcgga 
agagtgagtg 
cagtgagcac 
tgatgtaggt 
gggaagttca 
caaaatgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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acaaggtctc 
ttattgcatc 
tgtcttttgc 
ggagggtaca 
agtcaggagc 
gtgtagatca 
cccacaaact 
gtaaagtgca 
gcaacatggc 
cttgagcaag 
atagggttgt 
tgagtgcttc 
ctctgttgcc 
cgggttcaag 
accatgctcg 
ctggtctcga 
acaagtgtga 
ccacagggta 
ccrtgacctc 
aggnnnnnnn 
nnnnnnnnnn 
agacagggtt 
acccacctca 
aggcttattt 
agcctaatac 
gtctgaatgc 
cagtagaaga 
gtttttcagt 
ggactttatt 
attaagggag 
caaaaagcga 
ggaaggaatg 
cctggccggg 
ggatcacgag 
taaaaataca 
gctgaggcac 
ccactgcact 
aaagaaaacc 
aggcagcagc 
gtcatcaaag 
ctaagccaga 
gcctccctgt 
tttcctttct 
aaggtaatat 
gtgatactgt 
caagctacat 
acaatgtcta 
agaaaaagaa 
aaattttatt 
tggctcatgg 
cctacctcag 
tttgcatttt 
aactcaagca 
tgctcctggc 
caggggctgg 
ggttttgctc 
ttctttcttt 
ggtgcgatct 
gcctcccgag 
ttagtagaga 
tccgcccgct 
agtttccctt 
tcagctctac 
ttatttacta 



tgggcctgtc 
aggtgcagcc 
cccctgctgg 
tgaagatccc 
gggagttgag 
cctaaccttg 
gtaaagttag 
aaacggtgtg 
taagggcacg 
tcatttaact 
tgtaagattt 
agagtttttt 
caggctggag 
caattctcct 
gctaattttt 
actcctgacc 
gcccccgtgc 
ttgctgttgc 
ctgggctcaa 
nnnnnnnnnn 
nnnnnnnnnn 
ttcccaatgt 
gcctcccaaa 
tcttgataaa 
atccaggctc 
cgcttccaca 
tgttatcttc 
caagatatat 
tccttgtttc 
ctggtaccgg 
aaagttgacc 
tccctcatca 
cgcggtgggt 
gtcaggagtt 
aaattagctg 
gagaatcact 
ccagcctggg 
cagctctaac 
aagcccagag 
acccaggttc 
ttgttcttgt 
tcatcttcag 
gtctgggatt 
acactgattg 
catgattgtc 
tattctgaac 
tcccatatcc 
agactgtcca 
ttaagacatt 
cagcctcaac 
cctcccaagt 
ttgtagagat 
atatgcctgc 
tcatctgttt 
agatctagca 
ataattcccc 
tttttttttt 
cagctcactg 
tagctgggac 
ctgggtttca 
tcggcctccc 
ttcaatgagg 
tttttcttcc 
ggcttactgt 



gggtagctgg 
aggcccgtga 
tcacaggagg 
aggaccgcca 
gagtttgagt 
ctcttgattt 
ttgtgatgat 
cagtttgagg 
agctggagtt 
tcctgagctg 
tattaaatgg 
tcttttgttt 
tgcagtggca 
gcctcagcct 
gtatctttag 
tcatgatgct 
cctgccaggt 
ctgggctgga 
gtgatcctcc 
nnnnnnnnnn 
nnnnnnnnnn 
tggccaaggc 
gtactgggat 
ttacccagtc 
agcatcagtg 
cacctttggg 
tcctttctct 
ttcagtggct 
tagaagaggg 
gtggtgcata 
tgcgagggtg 
gt'tacaatat 
catgcctata 
caagaccagc 
ggcgtggtgg 
tgaaaccggg 
caacagagcg 
tggcttaaac 
gtaggggact 
tttctgtctc 
cttgattaca 
aagtagaaag 
gcttaggtcc 
tcttaagtca 
ttgggctcat 
agtgggaagt 
ctaggtccag 
tccatttatc 
ctcactctgt 
ctcccaggct 
agctagaact 
ggggtttcgt 
ctttgcctcc 
aataatttat 
gggaccaggc 
ttttcttttc 
gagacagagt 
caacctctgc 
tacaggcgcc 
ccatattggc 
acagtgctgg 
cctccctgac 
ccatagcact 
gtattgtctg 



tttcctcagg 
gcctcctggc 
agctgcagca 
gctccatgat 
cctgctctcc 
ccttacctct 
tgaatgacac 
gtggttaccc 
aggctgacct 
cagtttcctc 
ggtaataaat 
ctttcccccc 
tgatcttggc 
cccaaatagc 
tagagacggg 
cctgcctcag 
tttttttttt 
gtgcggtagt 
tgcctcagcc 
nnnnnnnnnn 
nnnnnnnnnn 
tggtctaaaa 
tacaggcgtg 
tcaggtattt 
gacccaggtg 
acctagggac 
gaccagtagt 
tcactgagcc 
ataacacata 
aggacaggat 
acacttatca 
ttggagagtg 
atcccagcac 
ctgaccaacg 
tgtgtgcctg 
aggtggagtt 
agactccatc 
agtaagaaga 
ccaaggttgg 
ggcacctctg 
agttggctgc 
tggcccttcc 
acccacctga 
gggttcctga 
cagggcagct 
aggaaagaga 
gtcacagtgt 
tacctattca 
cacccagact 
caagtgaccc 
aaaggtgcat 
catgatgccc 
caaaatgctg 
gaaacaacta 
agatctctgc 
tctttcgttt 
ctcactctgt 
ctcctgggtt 
tgccaccatg 
caggctggtc 
gattacaggc 
ctccatactc 
caagacctcc 
tcttcctcta 



gccccttgca 
aggggtcctc 
cctgcctggg 
aagtggaagg 
agttataggc 
taaactagca 
catgggtgtg 
ccagtgccga 
gctgcttcca 
atcagtaaaa 
gtcaagtatg 
cgccccgaga 
tcactgcaac 
tgggactaca 
gtttcaccat 
cccccgaaag 
tttttttttt 
gcaatcatag 
tcctgagtag 
nnnnnnnnnn 
nnnttttgta 
ctcccaacct 
agacaccgtg 
ctctacagcg 
ggagagctta 
tccctctctt 
tggtgatggt 
caagttccct 
ttttctaagg 
agagcaatgg 
aatcagcaca 
caagttatag 
tttgggaggc 
tggtgaaacc 
taatctgagc 
tgcagtgagc 
tcaaacgaaa 
tctattatat 
ttgatccagg 
tctgcagggc 
tgggccagca 
ccagtcatgg 
agccaatgac 
gccagtcttg 
ctgcagatga 
cattttggga 
cttggttgga 
tctggttttt 
ggagtgcagt 
tcccatgctc 
gccaccacgc 
atgctagtct 
ggattgtagg 
ctgggtgctg 
caagtcgttg 
tttttctttt 
tacccaggct 
caagcgattc 
cccggctaat 
ttgaactcct 
gtgagccacg 
tactcctcca 
taacatacta 
ctagaatgta 



acctctgagc 
cacacctggc 
ctgcttctca 
agctccttgg 
tatgtgactt 
ctaaaagcac 
gaagctcttt 
ttctcagagg 
gccctgtgag 
tgtgataagg 
tagcccatag 
tggagcctta 
ctccgcctcc 
ggcgtgcacc 
gttggccagg 
ttttgggatt 
tgtaaaacac 
ttcactgcag 
ctgggaatac 
nnnnnnnnnn 
ttttaagtag 
caggtgatcc 
cccagccagg 
atgcaagaac 
agatgtcaag 
tttctttttt 
ttcagagata 
cgcctctcta 
tggttgtgag 
tgagaccact 
cagtgggagt 
aaaacccagc 
tgaggcaggt 
ccacctctac 
tactcaggag 
cgagatcgca 
aaaaaaaaag 
tatccatctc 
gcttaacgat 
cagcttcatc 
gacgctgcct 
aatgaaagag 
tgtcaccagg 
ggcaaggagt 
gatcaaactc 
gatacaaaac 
catcaaatgt 
gatttttttt 
ggtttgatca 
aagtgatcct 
tcagttaatt 
ggaattcctg 
catgagccac 
agcacggggc 
gtttcttaaa 
ctttctttct 
ggagtgcagt 
tcctgcctca 
ttttgtgttt 
gaccttgtga 
gcgcccagcc 
cctggcccac 
cgtaagttat 
aactccatga 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4 620 
4680 
4740 
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gaatagaaat 
gcatacaaca 
tattgttatg 
tagagtttaa 
ttcagtcatg 
tttatttcct 
agggagctgg 
tgacaaagcg 
tttgggccgg 
cacttgactt 
ctggtcaaca 
tgccttattt 
tgtttctgat 
tgacattttg 
gctgaaacat 
ttatatgttt 
aatagcacat 
tactggttta 
ttctgtcacc 
tgggttcaag 
atgacaccct 
aggctggtct 
ggattacagg 
tatacttata 
ccagaagaca 
cttctagtgt 
taatgtgtgt 
caaaaatttt 
cattcgggga 
caacatggtg 
cggctatagt 
tgaggctgna 
nnrmnnnnnn 
nnnnatattc 
aggctggagt 
gattctcctg 
tatttttagt 
aggtgatcca 
cctggcctcc 
atatatatat 
gatcaatagg 
tgatgagggt 
tttttttttt 
gcatgatctc 
cctcctgagt 
tcagtagaga 
gatccgccca 
cctgcccatc 
gatttgtgtt 
tggtatatct 
tttctattac 
ttgtaaaaat 
gcacaacagt 
ttggtgtcat 
ttccttcttt 
ctccctctct 
ttccttcctt 
tctgtcaccc 
gggttcaagc 
ccatgcccag 
tggtttcgat 
caggtgtgag 
tagttagatc 
tgttgtaaat 



ttttgccttt 
gtcactccac 
agggagtgtg 
gttttctttt 
acgtatttca 
tatttctaga 
tatggggtga 
tcagagacca 
gcacaatgac 
gcagaaggtc 
gagtgagacc 
ggtctattcc 
tggttaagct 
gtcaaggaca 
tcctattgtc 
gtggcaatgc 
acagttatat 
tatatttact 
caggctggag 
tgattctcct 
gctaatatgt 
tgaactactg 
cgtgagccac 
aaaaaaagct 
ttgttatcat 
ggaagtggaa 
gtttgtgtct 
taaatagaaa 
ggccaaggtg 
aaaccccatc 
tccagctaat 
gtgagctgtg 
nnrmnnnnnn 
ttaaaaaaat 
gcaatggcgc 
ccttagcctc 
agagatgggg 
cccccctcgg 
tttacatttt 
ttttaagact 
tgtaaacacc 
tcactttctt 
tttttttttg 
agctcactgc 
agatgggatt 
tggggtttca 
cctcggcctc 
ttttttttgt 
tccctactga 
ttggagaaag 
tgagatgtaa 
tttccttcca 
ttttaatatt 
gtttaaggaa 
ccttccttcc 
ccctccctcc 
ccttccttcc 
aggctggagt 
aattctcctg 
ctaatttttg 
ctcctgacct 
ccaccatgcc 
cttgatccat 
tgcccagaag 



ttatttagtg 
ccacagttgc 
catgttgtta 
ttaagcaatc 
atggcttcac 
acggggataa 
tgcataagga 
gtatttacga 
tcacaactaa 
aaggctgcag 
ctggctcgaa 
atttgaaagt 
taagttttgt 
gaccacatat 
tatggcgtag 
tggtgtaaac 
atagtacata 
atattattta 
tgcagtggcg 
gcctcagtct 
tttgtatttt 
acctcaagtg 
cacgcatggc 
aactgtcaaa 
aggagatgac 
gacagtgata 
ttgcttttaa 
aaagctgccc 
ggtggattgc 
tctacaaaaa 
cgaggggctg 
attgnnnnnn 
nnrmnnnnnn 
ttttttttat 
tatctcagct 
ccaggtacag 
tttcaccatg 
cctcccaaag 
tttaaattta 
agccaagtga 
actgcactgg 
tgtagcctca 
agatggagtc 
aacctccgcc 
acaggcgtcc 
ccatgttggc 
ccaaagtgct 
tatagccatc 
tcaatgatgt 
gtctattcag 
gagttcttta 
ttattgggtt 
gaagtccaat 
ctattgccta 
ttccttcctt 
ctccttccct 
ttccttcctt 
gtagtggcac 
cctcagcctc 
tatttttagt 
cgtgatccac 
cggcctgtgt 
tttgagttga 
tgggtatgct 



tggtgtctgc 
tgaataagtg 
gagaaaaatc 
cacgaattgg 
tgagcccaag 
cacatagttc 
caggatagag 
cggaaatgtg 
taaaactttg 
tgagctgtga 
acgttatttg 
tcctagttct 
tttcctttaa 
acagtggtgg 
tggtcctgat 
acacctactg 
atatctgata 
tctttatttt 
cgatcttggc 
cctgagtagc 
tagtagagat 
atcaccccgc 
ctatttataa 
cagcctcggg 
agctccgtgc 
ttgatgatag 
caagaaagtt 
aggaacaatg 
ttgagctcag 
atacaaaaat 
aggtgggagg 
nnnnnnnnnn 
nnnnnnnnnn 
ttttgagaca 
cagggcaacc 
gcgcccgcca 
ttgtccaggc 
tgctggaatt 
attttaattt 
agcagtggga 
gaccagccta 
acaatatgtg 
tcactctgtt 
tcccaggttc 
accaccacgc 
caggttggtt 
gggattacag 
ctagtggatg 
tgagcatctt 
gtcctttgcc 
tgttctagat 
gtctttcact 
tttctatttt 
atctcaggtc 
ccttctttcc 
tcctccctcc 
ccttcctttg 
gatcttggct 
ctgagtagct 
agagacgggg 
ccgccttggc 
tttcttagag 
ttttgtatat 
tctaaatctg 



agcccctggc 
actaaaggtc 
tgaggcacaa 
ggtagtttca 
ttctttcacc 
ataaggcagt 
cagtggtgag 
gaagcatgat 
ggaggccaag 
ttttgccact 
attggttaca 
ataattttaa 
tacagttaag 
tcccataaga 
gttgtagcgc 
cactgccagt 
atgataatac 
atttttgaga 
tcaccgcaac 
tgggattaca 
ggggtttcac 
ctcggcttcc 
ttattttaga 
caggtccttc 
atattattgt 
gacccagtgt 
taaaaagtta 
gctcacacct 
gagttcaaga 
tagccgggtg 
atcactgggg 
nnnnnnnnnn 
nnnnnnnnnn 
gaatttctct 
tccacctcct 
ccatgctcgg 
tggtcttgaa 
tacaggcgtg 
tttaattttt 
gtggaaaagg 
ttttacattc 
tcgttgccca 
gcctaggctg 
aagtgattct 
ccggctaatt 
tcgaactcct 
gcatgagcca 
taaagttttt 
ttcctgtgct 
cactttaaaa 
ataagtctcc 
ttcttttggt 
tctcttttgc 
acaaagattt 
ctccctccct 
ctccctcctt 
tccttctgac 
cactgcaacc 
gggactacag 
tttcaccaca 
ctcccaaagt 
ttttgtagtt 
agtgtgagat 
gctgttaggg 



ttagtccctg 
cctgccctca 
taaaatttta 
gaggtagttt 
tctctaggac 
tatgagagta 
accgctcaga 
aaagaaatta 
gtgggaggat 
gcactccagc 
gttatacagt 
gtttgttggc 
tgccccataa 
ttataatgga 
aatgcattag 
gatataaaag 
ataactatat 
cagagtctca 
ctccgcttcc 
ggtgtgcacc 
catgttggcc 
caaagtgctg 
gtgtacgcct 
aacagatatt 
ccctgaaaac 
aggcctaggc 
aaataaaata 
gtaatcccac 
ccagcctggg 
tggtggcatg 
gggaggtggt 
nnnnnnnnnn 
nnnnnnnnnn 
cttgttgccc 
gggttcaagc 
ctaatttttg 
atcctgcctc 
agccactgtg 
aatttctcat 
aactggtttt 
ctgttagcag 
tctttttttt 
gaatgcaatg 
tgtgtctcag 
ttttgtattt 
gacctcaagt 
ccgcgcccgg 
ttgtgatttt 
tattggcttt 
ttaggttatc 
tacatatgat 
gtcctttagt 
cacttgtatc 
acacctgtgt 
ctctccctcc • 
ccttccttcc 
ggaatcttgc 
tctgcctcct 
gcacacacca 
ttggccagga 
gctgggattg 
ttagctctta 
atccacctgg 
attactagag 



4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 



28/127 



BNSDOCID: <WO 0229059A3_IA> 



WO 02/029059 



PCT/US01/31488 



gtgaccaaag tgaatttttt 
ccaggctgga gtgcaatggc 
gcagttctcc tgcctcagac 
ggctaatttt tgtattttta 
actcccggcc tcaagtgatc 
gagccaccgt gcccagtcct 
tttgagatag ggtctcactc 
agccttgaac ttctggactc 
acaggcatgt gcttccacac 
cttgcaaagc agggtacaac 
cagccaaaat ccttgaggct 
tcatgctgca tgatcaaaag 
gtcagtcttc ctctgcagtg 
tagtgtgtgg aaagaggcag 
aattttcttt ctatctttct 
gctctgttgc ccaggctgga 
ccaggttcaa gcgattctcc 
caccatgccc ggctaatttg 
gctggtctcg acctcctcac 
attacaggcg tgagccaccg 
atggtaaaat gggactagta 
taagcagata aaatttcaga 
ctgttatttg ctatgtgacc 
caaagtggac aattgtcaac 
catttaacaa aaatatgaag 
ttattatttt agaaataggg 
acagctcaat gcatcttcag 
gtagctggga ccacaggcat 
ttttttgaga cagtcttgct 
actgcaacct ccgcctcctg 
ggattacagg cctgtgccac 
gtttcaccat gttgcccagg 
ggcctcctaa agtgctagga 
ttattttttg tagagacagg 
tcaagtgatc ttcccacctc 
gcctggcctg gaactcatta 
acatcctctg acctacctat 
taggcagaag cagttcaatt 
aaacaattgc taaagttctt 
tacaaatctg tgatacaggt 
agctcagaga tgctaagtaa 
aaagatttag tttcacattg 
ggatgtccct gaccttcagt 
caccagcgag agcccaatcc 
tgtcaccaga ctggagtgca 
ctcaagcaat cctcccacct 
cacctgacta attttaaaat 
gtatcacttt gtcacccaag 
acctcccagg ttcaagtgat 
tgcgccacta tgctcagcta 
ccagggctag tctcaaactc 
ccgggattac aggcataaac 
catgtcagag gcaggaaagg 
acaaagaaaa tgggatcatt 
gagccagcct ctcaggtggt 
ctcagagagt ttctgcagca 
tccaggtgct tcacaaccat 
catttccctt tgccatagct 
taaaaaaaaa tagcttcatt 
ccacccccac acgttataca 
acgtgtatcg ttttttgtta 
ctctgtagcc caggctggag 
tgggttcaag cgattctcct 
actacacccg gctaatgttt 



ctttgtttct 
ttcatcttgg 
tcctgagtag 
gtaaagatgc 
catctgcctc 
tttctcagaa 
tgtcagctag 
acgtgatctt 
ctggctaatt 
atgcctatct 
ctcgggctga 
tgtccgtttt 
tcccaagaac 
acttccaact 
ttcttttctt 
gtgcagtggt 
tgcctcagcc 
tgtaatttta 
ctcaggtgat 
tgcctggcac 
atccatacct 
ctacggatgg 
ttatcttctc 
ctttcttcca 
agctcaacaa 
tcttgttctc 
cctcctggat 
gtaccaccac 
ttgtcgccca 
ggttcaagtg 
catgcccggc 
ctgatctaga 
ttacaggcgt 
ttctcactat 
agcctcccaa 
ttgaagcatt 
aagggtatca 
tatcacaaac 
taggcattta 
attattttta 
cttgtgcaag 
gactccagaa 
ggcccagggc 
taagattttg 
gtggcatgat 
cagcctcctg 
tttatttaat 
ctggactgca 
cctcccacct 
atgtttttat 
ttggactcaa 
caccacaccc 
gaagcatcat 
ttgtgagtgt 
tgagaccaca 
cagctagtgg 
cccaaaccta 
agtgttattt 
ctaaagtgta 
tatacatgta 
ttttttttgt 
tgcagtggtg 
gcctcagcct 
gtatttttag 



tttttttttt 
ctcagtgcaa 
ctggtattac 
agtttcacct 
tacctcccaa 
tttatttgtt 
gcaggagttc 
cccacctcag 
ttttaatttt 
ctacctacct 
ctgtgggtgc 
ctgcttcttg 
acagaattag 
ccgctatgtg 
ttcttttttt 
gcgatctcac 
tcctgagtag 
gtagaaacag 
ccacccgcct 
ttaattttct 
tatagcgctg 
gattgttact 
tggatctcca 
aagagcaatg 
atgaggaact 
ttgcctaggc 
acaagtgatc 
gcacggctaa 
gactggagtg 
attctgctgc 
taattttttt 
acccctggcc 
gagcctctgc 
gttgccaggg 
agtgctgaga 
cactagtatc 
cagctaacgg 
tactctatat 
ctgtttgtta 
ttaaccccat 
gtcacacagc 
cctttctact 
tctcctgggg 
tttgtttgtt 
caatgctcac 
agtagcttgg 
taattactta 
atggtgtggt 
cagcctctgg 
tttttgtata 
gcgatcctcc 
aacccctaag 
ggggttagga 
tcgctgtgtg 
gtccccattt 
aattagaatt 
gtcatttgca 
acttaataat 
atattcttgg 
tgtttctaat 
tgttgttagt 
tgatttcggc 
ctggagtagc 
tagagacagg 



ggagacagag 
cctctgcctt 
aggcgtgtac 
gttggccagg 
agtgctggga 
tttttttgtt 
agtggtgtga 
cctcctgagt 
ctaggactta 
ctcttccctt 
tgttgcctga 
gaactttatt 
accaggaatc 
ctgttgggtg 
ttttttggag 
ctcactgcaa 
ctgggattac 
tgtttcacca 
tggcctccca 
taatacctca 
ttgtgaaaat 
acattctgaa 
ttctttccaa 
atttaaggat 
cattattatt 
tggagtccag 
ctcatgtctc 
ttttttattt 
cagcagcgca 
ctcaacctcc 
gtatttttgg 
tcaagtgatc 
acctggcctc 
ctggtcttga 
ttacagatgt 
aactttgggg 
agcctctgtt 
ccagcataag 
gttagatatt 
tttatagaag 
tagtaaataa 
gggactcatg 
gaatccagcc 
tttgagacaa 
tgcaaccttg 
actacaggtg 
ctattatttt 
ctcagctcat 
agttgcaggg 
gatggggtct 
tgtcttggcc 
gtgtttttgc 
aaggaacact 
tgtatgtgtg 
cccagatgag 
tgaacccggc 
gtttaccttc 
tccttttgaa 
aatatcgggt 
acatatatat 
tttttttaga 
tcactggaac 
tgggattaca 
gtttcaccat 



tctccgtcac 
ctggtttcaa 
caccatgctt 
cttttctgga 
ttacaggtgg 
ttgtttcatt 
tcattgctgc 
agctaggatt 
tttgtccatt 
caagggactc 
tctgcctcag 
cactttgggt 
tgtgttgcca 
attgaagctt 
atggaatctc 
cctccgcctc 
aggtgcatgc 
tattggtcag 
aagtgtcggg 
attaccccat 
gaaatgaggg 
cctggctttg 
gtctataaaa 
caaatgatgt 
attacaatta 
tggtataaac 
atccccctaa 
tttattttta 
atcaccgctc 
caagtagctg 
taaagacggg 
cccctttctt 
ggctaatttt 
actcctgggc 
gagccactgt 
ttacctggcc 
tctcagaatt 
tgcccaaata 
tagtcctcac 
agaaacctga 
agggcagagt 
ggaatagtgt 
atagacaaga 
ggtctcactc 
atctcccagg 
cacaccacca 
ttgagacagg 
tgcgtcctcc 
actgcaggtg 
cactatgttg 
tcccaaagtg 
tgaatgtgac 
gagcagggag 
acaattctca 
ataatggagc 
tcttccagac 
atgattttac 
tcagtctgct 
ttgctgttac 
gtacgtatat 
tggagtctct 
ctctgcctcc 
ggcacccacc 
gttggccagg 
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tgggtcttga 
ttataggtgc 
aataactatg 
acatgctgtg 
aatcactgtc 
atgccaaaga 
ttgggctggc 
aagccttcct 
gaatgaatgt 
gtggaagagc 
tggcccatgt 
cttgagtttg 
ggacccttac 
attaaatggg 
tgggattgtt 
gtctaattca 
ggtaaagcag 
gtatgacctt 
tgcctagatc 
tcacccaggc 
tcacgccatt 
gcccggctaa 
atggtctcga 
acaggcttga 
tggattttta 
ataagggtca 
gtgcagcctc 
acagtgctgt 
ataaacctgt 
nnnnnnnnnn 
nnnnnnnnnn 
gggtagatca 
ctctacaaaa 
cgggaggctg 
atcatgccat 
aaaaaaaaag 
attggttcct 
cctggcttgg 
tccaccacta 
tggccagtcc 
atagtgatca 
catttgtggg 
gggctgcctg 
ctgtactttt 
tgccgtggcg 
acctcagtct 
gtatttttag 
tcaggtgatc 
gcccggtcct 
ggcatagttc 
agctcttggc 
gtgcagcaaa 
atttcaattt 
agtgtcccct 
cagaccctgg 
ggagaggatg 
ttgccagccc 
cctcaactgg 
atcatctgcg 
cattaaactt 
catattaata 
gtagattttt 
ggctgtggca 
agcactgcac 



actcctgatc 
gagctactgc 
aaacctgaaa 
cataccacat 
ccatcctatg 
ccttttctgt 
tctcctttta 
t.atgaatgaa 
atgaattgac 
acatgtcctt 
aacacagagg 
aatctgagcc 
catttataaa 
attctctata 
tgtgagggaa 
gggactgcaa 
ggcaggtgta 
gagaaggcaa 
ttctaacttt 
tggagtgcag 
ctcctgcctc 
tttttttttt 
tctcctgact 
gccaccgcac 
tgtgataact 
acattgccag 
tggtctaaaa 
tgtatagtaa 
agagaggccg 
nnnnnnnnnn 
nnnnnnnnnn 
cctgaggtca 
ataaaaaaat 
aggccggaga 
tgcactccag 
aaaaaagaaa 
gagagtagcc 
gccaatgaat 
gggaggtctc 
tcctacagtc 
aggccgttac 
tcttttggaa 
tccctgctcc 
tttttttttt 
tgatctctgc 
cctgagtagc 
tagagacagg 
cacccgcaca 
accaatctgc 
tgagaagcat 
agagacatct 
ttttgtctag 
ttaagaaatg 
taatttctta 
ggtgacagaa 
atctacagag 
tgaagctatg 
ggccattttt 

ggggtgggg* 

tctacaacgc 
gcacaaagtt 
cacacattaa 
ggaggattac 
tccagcctgg 



tcaagtgatc 
ggctggccaa 
aactgctcca 
attgggaaac 
cttcgcttta 
tgtcattaac 
gttcagaatt 
ctcgttggtc 
actaagtcat 
tctggcactg 
ccctttggtg 
tgcctgtaag 
atggggacca 
gaaaatattg 
attgtcattt 
atggaaatgc 
gaatagcaat 
gccctggcaa 
ttatttttat 
tggcgcgatc 
agcctcccaa 
gtatttttag 
ttgtgattta 
ctggccgatc 
cctgatttta 
cagagcaaaa 
tcatctttgt 
atatccagta 
ggcacagttg 
nnnnnnnnnn 



nnnnnnnnnn 
ggagttcaag 
tatctgggca 
attgcttgaa 
cctaggtgac 
agaaaaagaa 
aaccttattg 
gaggaagaat 
aagcttgaaa 
aagtagataa 
tctgcaatca 
gagcagagaa 
ctgagccatg 
ttttttgaga 
tcacagcagc 
tgggattaca 
gtttcactgt 
tgcctcccaa 
acattttaat 
cccacaagga 
ggcaaattca 
agcttggtca 
ttgcattaaa 
gtgtgtctga 
gactggtggg 
tgctggagag 
ccttgaggtt 
ctccccagag 
gaggggctac 
ccaggacagc 
gaggaacctt 
agatgagaaa 
ttgagcccag 
gtgacagagc 



cacctgcttt 
tgtatgtttt 
tgttacttcc 
actgctttcc 
tttgtctata 
atggacacag 
tggattttta 
aaactcataa 
tagcatttca 
atgtgtgctt 
cagtggaaaa 
aagctggcta 
ttgtactcac 
gcacaaagta 
gattatcaaa 
cagctgaggc 
gtctgggaat 
aaggggatgg 
ttttattttt 
tcggctcact 
gtagctggga 
tagagatggg 
ccctccttgg 
ttctaacttt 
aactggcacc 
catgggtggg 
atttcttctt 
agtgggtgta 
ctcatgtctg' 
nnnnnnnnnn 
nntagtccca 
accagcctgg 
tgattgcagg 
cctgggaggc 
ggagcaagat 
acaatgaatg 
gtcatcccag 
aatggcaaat 
tccagtgacc 
gtggagggtt 
gactcagaaa 
gaggaaggaa 
gagggctggg 
cagagtctcg 
atctgcctcc 
ggtgcccacc 
gttgggcagg 
agtgctggaa 
tgacaagggt 
tgcctctgag 
aggcttgaga 
gttaaaattt 
atgttattta 
gttgagagcc 
agatgggagg 
actctgtatg 
gggcaaggtg 
gacatttgga 
tgacatctgg 
ccctccacaa 
gatctagatc 
gcttgtgcct 
gaattcaggg 
aagattttgt 



ggcttcccaa 
taatacacat 
tgaacccatc 
ctggcttcca 
aatgttgggg 
ctctaagagg 
tccaactact 
aaggctgatc 
cgggaatgga 
gggaaactta 
ctgttgactt 
actgaattgc 
cctttagggt 
ggtgtaaatt 
gacttaggag 
caggcatttg 
taaaagagag 
cctccactca 



tgagacggag 
gcaagctccg 
ctacgggcgc 
gtttcaccgt 
cctcccaaag 
ttaaagagaa 
caattataat 
ggcaactgct 
gctttacgca 
gaatgaataa 
taatctcagc 
nnnnnnnnnn 
gcactttggg 
ccaatatagt 
tgcctctaat 
ggaggttgta 
tctgtctcaa 
agtgtgaggc 
ccacgaagtg 
gggggtctat 
aggtttttag 
tggtccgttg 
tggcctctca 
tttaggtctt 
gtggaatatg 
ctccgtcgcc 
cgggttcaag 
accacgcccg 
ctggtctcaa 
ttacaggcat 
caccctccac 
ttcaccctga 
catgctggcc 
tgatgttttg 
tcttgaatag 
tcccctgcct 
tgaggagggg 
gagcttttca 
gcatatccta 
aacatgtgga 
tgagtagaga 
taaagagtta 
cacagcacag 
gtaatccctg 
ttacagtgaa 
ctcttaaaaa 



agtgctggga 
tcaaataacg 
ttgagtgctc 
agcccagctt 
ttgggggttg 
tcttggcatc 
cagagtgatc 
gataaaacag 
ttctccgtta 
ctgagctaac 
tggagattat 
tttgcttctt 
tattgcatgg 
tgcacgctag 
caggaacagt 
ctaataattg 
gtgaggacgt 
gctacagtca 
tcttgctctg 
cctcccgggt 
ccaccaccat 
gttagccagg 
tgctgggatt 
gcaagacatc 
ttacaacact 
ggtcaccggt 
ttgtcccagc 
accaatgcag 
acnnnnnnnn 
nnnnnnnnnn 
aggccaaggt 
gaaaccccgt 
cccagctact 
gtgagccgag 
aaaaaaaaaa 
tcatggtagt 
aaatggtacc 
gcctccaccc 
gtcctggacc 
ggctacggag 
gttacttctc 
ctcaccctct 
gggaataaat 
caggctggag 
ttattcttcc 
gctaattttt 
atacctgacc 
gagccaccgt 
tcatgtgcca 
caagtccact 
tctctttaaa 
ttttgcatta 
taaatttctt 
gattctagtc 
agtgttggtt 
tgctgcctgt 
gatcagagat 
gacatttttg 
ccagagggac 
tttgacctca 
aagaaaggat 
tgactcagga 
ctatcatcgc 
aaaaaaagat 
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gaggacaggc 
gatcacgagg 
aaaaatacaa 
gggaggtgga 
gcaagactcc 
agagagagag 
cttccacttc 
tgcctctgtg 
gtgttgcaca 
cctcttgttt 
tgggattcca 
ttttaggggg 
gaagcaaacc 
gcagtcccag 
tcctttgtcc 
acccagtata 
gactcttgga 
agaagcagca 
agcagacctg 
cggagactgg 
tagactctcc 
taatcaaaca 
acaaagacag 
caatgcctct 
aagtctctcc 
ccccccgctt 
cagctcatct 
tttgtttgat 
tgttttgaga 
tcaccacaaa 
tgggactaca 
gtttcaccat 
cctcccaaag 
ctttcagaag 
ctcacacgtg 
ttcttgtcca 
ctccagtctt 
ccacggtgct 
aatgatgccc 
agttcaaagg 
gaggaggfcca 
gacccagctg 
cagggaagtc 
gcagaagctc 
cattcagtct 
agaatcacct 
atttaggagg 
atgatattgt 
gcctgtgtta 
cctttcaaag 
agttgtggtg 
gtaataaact 
attgtcctga 
ttttaacctg 
agagggaatt 
nnnnnnnnnn 
gttagcacat 
tggaggcaat 
tcaggaaagg 
gatgaggctt 
agtttggatt 
tgacaatgac 
cagaagatgg 
ggggacgatg 



acagtggctc 
tcaggagttc 
aaaattagcc 
gcttgcagtg 
gtctcaaaaa 
gaaagaaaga 
ccatggaacc 
aagtaggggt 
ctatgcccct 
cttcccccca 
ccagcaggag 
cataaataat 
agggtgtggg 
cagagagcag 
cccactctgt 
aaggctgctt 
aggtgagttc 
cctcataggg 
tccccgcagg 
aagacaaatg 
agctcagggt 
attaaacaaa 
ccccaagaag 
tgctcctgcc 
cttccctcct 
ttatttgcac 
ccggtaagat 
ttttttttgt 
cggagtctcg 
ctccgcctcc 
ggcacatgcc 
atttgccagg 
tgctgggatt 
gaaaaaccac 
ctttctccca 
tttgtgggtg 
gtcttcaggt 
ggaactagga 
cccatctgga 
agcaggagag 
aggcttctgc 
ccttagacaa 
caaggtgcaa 
aggtaagfcag 
tctgttccct 
ggagttgttg 
gctgaggtga 
tggtctgggg 
aattctggtt 
ctttagcacc 
agcattaaat 
ctcaataagc 
cagctaatta 
aaaagggatg 
cnnnnnnnnn 
nnnnnnnnnn 
agccagaaca 
ggcacttgca 
agagcttcag 
ggaggctggg 
ttctcctact 
acgtttgcgt 
aagataaagc 
tgagtgacag 



atgcctgtaa 
aagaccagcc 
agctacttgg 
agccaaaatc 
gaaaaaaaaa 
gagaaggttt 
ttggcaagtt 
ggtacctggt 
aggatgggtg 
ttgtctccca 
gctactcctg 
aattatgcct 
gtacccaaac 
ggcacagggt 
cctggagcaa 
ctggcccctg 
ttttctggct 
caaacacgta 
ggcaggagtc 
aaatttggta 
ctgttcatgg 
tattccctct 
gacggtggag 
cttrtctcctg 
catgcgtctt 
tcatgaggcc 
atcacttgga 
tgttgttgtt 
ctctgttgcc 
cgggttcaag 
accacgcctg 
ctggtctcaa 
acaggcgtga 
ccacctgtta 
ccttcaaaac 
tgacccgaga 
ggaattccta 
agctcttgac 
gagcctagag 
agacttggct 
ctgtcaagac 
ggacttccag 
gatctgccgc 
tagggaggct 
attcatttag 
ttaaaacaca 
agcctgagaa 
agaaccaccg 
ctgcctttta 
ttcatttgta 
gagataatac 
ggtggttata 
agaagcaaaa 
ccgggctctt 
nnnnnnnnnn 
nnnnnnnnnn 
tctagaaggg 
aaggctttgt 
cttcagtgtg 
cagattttgc 
aaggacaaca 
tttggtggct 
actaaagggt 
taaatcattt 



tcccagcatt 
tggccagcat 
gaggctgagg 
ttgccattgc 
aaagatgaga 
tggagtcaaa 
gccttctctc 
ggagatgatg 
agagcttggg 
ccagctccct 
gagttaaggc 
ggacaatgga 
aaggcagtag 
tgagaagact 
tcattctggc 
aagccagagg 
gcccaggcag 
ggaggcctgt 
tgggctaagg 
cctgagttat 
caagaggaga 
atactatgtg 
tagtgtcctc 
ctgggaacag 
tccctttttt 
aggaccacag 
ctcagaactg 
tttgtttttt 
caggctggag 
caattcttct 
gctaatcttt 
actcctaacc 
gccaccgcac 
agatgaaggg 
acacactcgt 
gatctctcct 
ggaaagctcg 
ctgatggcag 
acacaggcag 
ctgacggagg 
aactttgagg 
tgccccaggg 
tacctattgg 
actgcggagg 
gctagtggtt 
gctttctggg 
tttgcattta 
atttaaacaa 
ttagctgtgt 
aaacgaagat 
atgaaaagtg 
attattatga 
gataggatta 
gcctggagac 
nnnnnnnnnn 
nnnnnnnnnn 
gtggtaggag 
tgaagtggcg 
gctggagtgc 
tccaaaagag 
gcaaactatt 
cactcgagct 
gatgaggcag 
gttgggttgc 



ttgggaggcc 
agtgaaaccc 
caggagaagc 
actccagcct 
aagaggaagg 
aagacttaga 
tttctctgaa 
cggagatgag 
agcactgaac 
gggatctcca 
gtgttgttca 
cataacatct 
gccccaggag 
gagcaaactt 
ctcttcccac 
cactgagagt 
gaccagtgta 
ccttaggaac 
gtgatactgg 
ccctcccacc 
aagcaatctt 
ccaggggcta 
gctaaaagac 
tttctgctct 
cctttttcct 
ccttccctct 
taacctggaa 
tttttgtttg 
' tgcagtggcg 
gcctcagcct 
gtatttttag 
ttgtgattcg 
ccggcaaact 
ctggtgactg 
ggtgtcggcc 
tacctaacac 
agcgacgttg 
gttacctctt 
acctaggcca 
aggtgattca 
atgaggaagc 
aagaagacat 
tgcggactcc 
acctggggga 
ctcaaagcct 
cctcacctgc 
caacaaatcc 
aaggctttgg 
tacctgggca 
atatagcacc 
tttggaatag 
gtattatcat 
agggaggcaa 
tcagaaactt 
nnnnnnnnnn 
nnnnnnnnnn 
tggggattag 
taagtgtgga 
tgggtgtgaa 
cttggtgaac 
gaagagtfcta 
gccagccagg 
gaagccagtg 
tattgtgtgc 



gaagtgggtg 
catctctact 
gcttgaaccc 
gggcgacaga 
gagagaaaaa 
aattccagtt 
tctcacattt 
ggtgaggggt 
ctccctttcc 
cttcactctc 
gactggggca 
agggccttct 
gcaggtccct 
cattatcagc 
ttccccaaaa 
ggaggtctca 
ggccctggga 
atcatagcta 
agagcagcaa 
attccttttc 
gtttgctctt 
tactagacac 
agtagatatg 
tcatctgggt 
acactcccct 
ttagctgata 
ctttctcttt 
ttttttgttt 
cgatctcggc 
cctgagtagc 
tagagatggg 
cccgccccgg 
gtaacctgaa 
ccccaggctt 
agaagtcagg 
caaggaaatc 
ctggagctgt 
cttcccagag 
ggatctggat 
ggcagaggga 
catggagtcg 
tgttgaagtg 
taaaactttt 
aaagagagta 
cgcatgcatc 
acgacttctg 
ccaggtgatg 
tgttagaaac 
agttgctttg 
aacttcttag 
tcactgggct 
ttcctgtagg 
gtaggtttat 
gaaataaatg 
nnnnnnnnnn 
ngaattctct 
aggttccagc 

gagaggtgaa 
tgtgataagg 
aatcgttcag 
tagacagtgg 
aggagagaaa 
taagctctgt 
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gctaaattct 
caatcgttcc 
gcagcatgca 
ctgctctagg 
tcaacaatat 
gatccctatg 
aaaagggtat 
cagcactttg 
ggccaacatg 
cacgcctgta 
gcagaggtta 
accctgtctc 
taaactgcta 
aaagaattag 
taatcccagc 
cagcctgacc 
gtggtgcatg 
cgggaggcag 
agcaagaccc 
ctgcattcct 
acctcccatc 
tgtgtgcctt 
aaatgtcttg 
ggtattcatg 
tctaataaac 
gtgtaaagtc 
ctctctctca 
gcaagcatcc 
ggcagcctct 
tccttcccca 
agagaccctg 
tacatgtagt 
atacatatac 
cctacatcaa 
taggtaaaga 
gtgggaggac 
tctcaactaa 
gctcaggagg 
acggtcacaa 
aaaaaaaaaa 
tggaacttct 
gtttaaagct 
nnnnnnnnnn 



nnnnnnnnnn 
tgaactcctg 
tgtgcaccac 
tatatataaa 
ttttagagat 
tcctcctacc 
gagttatttt 
gagcctggag 
agtcacatag 
gtggaaggtc 
caagagcctc 
gcggatcatc 
ccaggagcat 
ctggctgagc 
aaggtggcag 
caacccgctt 
gtgggatgta 
gaaggggtga 
ccagttgggt 
cccctcattg 
ggcactgtcc 



tcacgtgtat 
cagctgaaga 
agttgtccag 
actgggggaa 
tgttggcctg 
agcaagtggg 
aagaaataac 
ggaggccgag 
gtaaaacccc 
atcccagcta 
cagtgaactg 
aaaaaaagaa 
gtcctcaggt 
gatgtcagct 
actttgggag 
aacatggcga 
cctttagttc 
gggttacagt 
tgcctcgaaa 
gctcggcaac 
actgatgtct 
ctgctctggg 
tggtaggcaa 
cctgtgtata 
agaacacagc 
tgtcttgttt 
ctttcttact 
atgttgtgag 
acccagcaag 
cttgagcttt 
agacagaaca 
tttaagctac 
catgtacata 
gaatgtgtca 
ggccaggcat 
tgcttgagcc 
aaaaaaaaaa 
ctgaggtggg 
aactgcactc 
aaaaggaaag 
attctagagg 
ccttcagcat 
nnnnnnnnnn 
nnnnattttt 
ggctcaagca 
catgcctggc 
tatatatata 
agggtcttgc 
tctgcctttc 
taccacacga 
gcaagtgaat 
ttagtgtcac 
tggggctcag 
tgctccccca 
ttctacatgg 
ggtgaggcac 
tcccctcacc 
agacaggtag 
ctcagggggc 
agagtctggg 
tagtgtccac 
tctactcctc 
gctacatcat 
tcggagatgc 



tatttcagct 
agctgaggct 
ccaggattct 
atggtcccta 
ctacatgtga 
atgggggtgg 
aattaggccg 
gagggtggat 
atctctacta 
cttgggaggc 
agattgcacc 
aacaaaacaa 
gcatttttgc 
tcttttgaaa 
gctgaggtgg 
aaccccgtat 
cagctactca 
gaaatgagat 
aaaagaaaaa 
agcctgatgg 
tccctcgctg 
tgcccagatg 
cctcaaagat 
atcctctccc 
aagggtaatg 
gtttccctct 
gtcttgtcct 
ctgacctatg 
gaactgagtc 
cagatgacgg 
ctcagctaag 
tatgttttgg 
actgtttcag 
ggcaccattc 
agggctcaca 
caggagttga 
aattagttgg 
aagatcgctt 
tagcctgagc 
aaagaaaaaa 
gggatggttt 
ctggggaaaa 
nnnnnnnnnn 
tagagatagg 
atcctcctgc 
ttatttcata 
atttctgtat 
tatgttgacc 
aaagtgttgg 
agcataggac 
ctgaactttc 
tccccccaag 
gggcgccccc 
ctttcctgtg 
ctgctgtgaa 
cgctgaggcc 
atacgtttcc 
ggctatgaaa 
gggactcctc 
ttcagaggaa 
tgagtgccag 
cgtcttcggg 
ggactggcgg 
caggtgacct 



aatccatcta 
ctcaaaagct 
aacttagaca 
gtgagatatg 
agagctgtgg 
agtggacagt 
gccaggggtg 
cacttgaggc 
aaaatgcaaa 
tgaggcacga 
actctactcc 
gtaggtactt 
ttatcagttt 
atttgggcca 
gtagatcacc 
ctactaaaaa 
ggatgctgag 
tgcaccactg 
gaaaatttgg 
tggatgggca 
tcatcatctg 
aattggatgc 
cactggggcc 
ttgagtgtgt 
ggatgctact 
cttgatcttc 
tccctttgtt' 
aagaggccca 
actcatcata 
cagccctggc 
ctatacccta 
gataatttgt 
ttgtctgaga 
caggtacttg 
tctataatcc 
gaccagcctg 
gcacagtggc 
gagcccagga 
aacagaaaaa 
ataggtaaag 
ttcacagtag 
ggttnnnnnn 
nnnnnnnnnn 
gtcttgctat 
ctcagcctcc 
tatatatatt 
ataaataaat 
accaggtctt 
gattacaggc 
atacatccaa 
ccgtctttga 
aactgcatga 
tgctgtctcc 
gagcctcctc 
caagatgctg 
cctgggggtt 
ctacccacag 
gcagggccct 
taggcctggg 
ggctattttg 
ttcctgaccc 
gccatgcagc 
atcaaggact 
gcctgtacag 



acaactctgt 
agtaacttgc 
ccagaggcca 
tgtcgagttt 
aaagcgccca 
aggagggctg 
gtggctcacg 
caggagtttg 
aattagctgg 
gaatcacttg 
agcctgggag 
tctgccatag 
taaaaatcag 
gaagcggtgg 
cgaggtcagg 
tacaaaaatt 
gcatgagaat 
cactctagcc 
aagatctgac 
gaggctcagt 
cttgacatgt 
tatatgagaa 
tccaatgatc 
actacacctg 
tctaaggtta 
ctctcattct 
• tactctgatg 
tgtggtggta 
tgggtgagct 
tgatgctttg 
tctcctgaga 
tactcagcaa 
ctatatttag 
gaatacatca 
cagcactttg 
ggtaaaatag 
acatgcctgt 
gtttgaagct 
gaccctgtct 
atccttgatt 
aagtctgtgt 
nnnnnnnnnn 
nnnnnnnnnn 
gttgcccacc 
tgagtagctg 
tttatatata 
aaatatatat 
gaactcctgg 
gtgagccatg 
aaatgttctg 
agaaaccagt 
gctgggacaa 
ccagggtctg 
accatgggca 
gagtaccttg 
gggggcacag 
agacaaatga 
ggctcacgcc 
cccacccagg 
gtgctctctg 
cactgccctt 
tgttgtgcct 
gcgtggacgc 
ggatggtgac 



aaggcaggta 
ctaagttcat 
cttttaacca 
catatttcat 
aagtgagtta 
gaacacacat 
cttttaatcc 
agaccagccc 
gctggtggtg 
aacccaggag 
acagagtgtg 
ggaggattca 
agaatgtctc 
ctcacgcctg 
agttggagac 
agctgggctg 
cacttgaacc 
tgggagacag 
aacagttgac 
tgtctgccaa 
aggcatttgg 
aacattctgt 
cctccttcct 
gatacttgct 
aattacaaga 
ctctctctcc 
aagcaagcta 
aggaactgag 
tggagacaaa 
caggcttgtg 
tagagtataa 
tagataacca 
tcattttaca 
attaacagaa 
ggaggcccag 
tgagacactg 
ggtgccagct 
ccagtgagcc 
caattaaaaa 
cttgccctct 
tgacagcgct 
nnnnnnnnnn 
nnnnnnnnnn 
aggctggtct 
ggaatacagg 
tgtatattta 
atatatatat 
gctcaagtga 
gcacctaact 
agctgagcaa 
ctctctccaa 
tcagagggca 
tccccttacg 
tgacccagct 
tgactggtgg 
gcgggtcacc 
acagcaacaa 
caccccactg 
taaccctttt 
gcctccgctg 
cccatcctgc 
tctcacctgc 
cccaactcag 
agcaagtggt 
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caggcagtgc 
agaaccctag 

agggacctgg 

caccttgcct 
ttgcagttga 
cagctttcag 
gggctagaat 
aggcttggca 
atcagggaga 
gctgtcagcc 
ttcaattacc 
gagcaagagg 
ataccatcac 
ggagactgga 
ctggagacca 
cccaggctgg 
agcgattctc 
gattctgtga 
aggatgggag 
agcctggccc 
ctagcagggc 
ctctctcctg 
tcaggccaag 
ctgaggactt 
cttggttctc 
tgcagatttg 
catgttaaga 
gaaacatgga 
tgttttctgc 
cagggagtcc 
ggttgctacc 
cagtgccttc 
caacttacac 
tccaaaggga 
ctggatctat 
tgggaattga 
ctctcctaac 
cattgttcga 
ttgggctgaa 
gggagggccc 
tcacttcctt 
tagaggaagc 
ggcccccaga 
ttgccttata 
cccctgctgg 
acaataatga 
aggatccttg 
gctctctttc 
tcacactcca 
tctgaccctt 
cctggattat 
tatttttaac 
tggtgtcctg 
ccctgcaagc 
ctctgccaca 
gcatctttgc 
gcccaatctt 
gcctcccttg 
tttttctcag 
ctgctccatg 
tatctccagc 
acagaataca 
ggggctcagg 
gctcagtggc 



ttttcatttt 
gagatcaggg 
cttgcttaat 
tgtttagtcc 
ggaaaccgag 
tggcagcctc 
aggtagagtt 
ttttaaaaca 
gactcagccg 
agggctgcag 
cccatggcct 
gctaagcgac 
ttctgttacg 
caaagcaggg 
ccattgaggg 
agtgcagtgg 
ctgcctcagc 
ctgcttttgc 
aaagagactg 
tggcactgag 
ccttctgtcc 
ggctggcctc 
ctgctgtccc 
ctgatgagag 
aaaccttcaa 
cagatatccc 
aagcacccgc 
gctagagaac 
tgccctgaca 
tgggaacgta 
aaatccatca 
accctgacca 
ctccaggtac 
tggtccatcc 
tttccgggtt 
ggcccagttg 
ctcttcactc 
ggtttcttcc 
atgccttcct 
caggcatccc 
aaaggcagcc 
tgaaatacct 
atggcctccc 
attttcctct 
gcgctgctga 
tgacaaaatg 
accatcactc 
cttcatctgc 
tgttcactcc 
cttgtcacct 
tgtggcagcc 
acaggggctg 
tctcatctgt 
tctggcccca 
gccacagtgc 
atgtactagt 
accttctctc 
gctgtcccat 
cgtttatcac 
ggaatgccac 
acctagagca 
gtggacagtc 
gagtttgagc 
cctgaagttg 



ctctgtgcgt 
gtacctcccc 
aataagaata 
tacaacacct 
gtacagggtt 
cagaacctgt 
ggtatattag 
acaaacactt 
gctgattctg 
tcatctgaag 
ctccataggg 
agagaccccg 
ttctattata 



gaataccagg 
tttttttttt 
cacgatctca 
ctcccgagta 
tctttttgtg 
ggagagaagg 
ccttcctcct 
aggacacatg 
agtgcccttg 
caggagggca 
acggtgtgag 
gtgtgcatga 
cgctgagatt 
tttgatgcgt 
gtgggtttct 
tcgaaggcac 
gaggcagcag 
gaccacgcta 
acctgctgct 
ccaccttcat 
agcacctgcc 
ctccaggaag 
gcctcttcgc 
tctctttttg 
actcagcctg 
gagctttgca 
ctaatgcacc 
tatcctgctg 
ggcttgatga 
ctccatgccc 
tcatgttgga 
gggactctcc 
acaaccatta 
cctgagttca 
tcaagccaga 
atcctcagta 
ccactgctgc 
tttagtctcc 
cagtcacctg 
ctcagagtaa 
gcacttcact 
ttcctagtgc 
ccctgtttct 
cttcttcagg 
gtacaattgc 
taacactctg 
ctcctcgatg 
gtgcctggca 
ctgcgaggtc 
cacttgccaa 
gagcatttgc 



ttacatccag 
attttacaga 
gctaatgcag 
ctttgaggta 
gcacagttag 
gttataaggg 
atatttattg 
ctcatctcat 
agtcacagtc 
ggctgactgg 
ctgctcagga 
tatcttctct 
ggcacagagc 
aggcaggatc 
tttttttatt 
gctcactgca 
gctgggattc 
ttcatccaaa 
aaatccagtg 
ctaccctctg 
ggccgcctgt 
attgtgccgg 
tctgcttatg 
agcttcccac 
gaatcactta 
ctgagggccg 
gtgtcattgg 
atgggtttcc 
atctgaaggg 
ccagtgactt 
ctgcaagatc 
tgtgggtttt 
ccttcccctc 
ttccaggaag 
tgtttctagt 
cctacccctc 
gttttagttt 
tgggagtctc 
accgtgatca 
tctctctgag 
taattgactc 
cactttggtt 
accctgaaga 
cagtccccac 
cctgtgtatg 
tcaagtgctc 
gcctcactgc 
accctggagt 
cagccagcag 
ctctccagtc 
cacatctgcc 
tcaggacata 
aagccaaagg 
cccctctagc 
tccggaagtc 
agaacattct 
tctttccctg 
aacctcccta 
cctatctctt 
gcaggaatct 
catagtaggt 
aatgccatcc 
tatcacacat 
cctcaaggct 



cagcttgttg 
tgaggaaact 
agtgctgact 
gatgcgttaa 
gtcattcacc 
tacacgctaa 
ctgtataaca 
acagtttctg 
tctcatga^g 
ggttggagaa 
cacagcacct 
cacataatct 
aaccctgata 
cttgagggct 
gagacagtct 
acctctgcct 
accatggagt 
cagtccctgt 
gcctccctcc 
ctcctaatgg 
cctcacccca 
agagaggaag 
tctagggcag 
ttcccacctt 
gtgggggata 
agatgaggcc 
gtaggggagc 
catagaaaca 
ggaggggcca 
cccgtactcc 
caaaagctca 
ggcatcacct 
tccctgcctc 
cgcagttctg 
agattgggtt 
cttcctccag 
gtgacctttg 
tatgctgcag 
gagaacccca 
accctctgat 
cccctgttgg 
ctatgtctgc 
aatttcccaa 
ttgcatctct 
tgatggagta 
cgttggtgca 
agcggtctcg 
ttccttgatg 
cagcttctac 
ctagccacca 
ctggccccgc 
agtctcttca 
ctttactatg 
tccccctcct 
tcaagtgtgt 
tctccagata 
actgtcctct 
ctgcccgctt 
gcttattgtc 
gttgacttgc 
tctcagctaa 
ctgttattag 
acaggaggtg 
ggggaccaaa 



ctttctccca 
gaggctagga 
gtgcacttgg 
tatcttcatt 
caagatcaca 
agtcttgtta 
aatcacccca 
acagtcagaa 
acatagtcag 
tctatgtcag 
gctttccctt 
cagacgtagc 
tactgtggaa 
gtcttgttgg 
tgctctgtcg 
cccaggttca 
cttgaaccca 
ttatcctaag 
ctgctagcag 
tgagggtccc 
gcctactgac 
cgctggacag 
ggacaccttc 
ccttcccatc 
tttgtccaaa 
tgtgaatctg 
aacactttga 
tggatttctg 
ggccaagaac 
tcagggacgg 
ccaatgccat 
gtctcatcaa 
ccgaggctcc 
gtcttctgat 
ggcgaggggg 
cctccacaca 
tcctgcacac 
tgtgagtctg 
gggaagggtt 
ggcagggagc 
agtcttccct 
tgtttgaaac 
gggcagccat 
ctcctggttt 
acaggacatt 
ggcagcaggc 
gcagagggca 
tttctctccc 
acaccccaaa 
acatctctag 
tgtctcagtc 
catcactctg 
gcctaaaaag 
ccattgttca 
tccctgcttg 
tctgcaaggt 
tctcagtgag 
ctctgcttgg 
tgaccgccac 
tfcgatcgtgg 
atgtttgttg 
tggaggaagt 
tgagaaccca 
gagcccatgc 
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aaagagcccg 
nnnnnnnnnn 
nnnnnnnnnn 
accacaatta 
cctgcaatcc 
agaccagcct 
ggtgtggtgg 
aaatcatgcc 
aaataaataa 
taaaaacacc 
gggcaagcgg 
agccctgtct 
ccagctcctc 
gagccaggac 
caaacaaaca 
aaagcagagc 
tgaatccaaa 
ctctgcccac 
tgggccagtt 
tgatgaccac 
gtgggggaga 
aaagaagttc 
agaagtttag 
cctagagagc 
atctctgaaa 
cattcatgtt 
tgctctaggt 
gggaaaaggc 
cacgtgcaac 
tcaaccccag 
tcttatggaa 
gataattaat 
gaccttggtg 
atgtgtggag 
gcccaccctc 
caaactcctt 
aacggtatgc 
ggtcctttga 
ggtcccctgg 
cactttgtga 
ccaaaggctg 
tgattctgta 
catctgagat 
cagggagacc 
ggatgatgac 
ctcactcggc 
ccttgattcc 
acgtggagcg 
ggtggggaca 
gagacgcagc 
ctctgccccc 
cctcagagag 
cacctctgaa 
ggctcatgcc 
ggagttcaag 
ttagctaggc 
aatcgcttga 
gcctgggtga 
gaaacagtga 
tgtactcaag 
aacccaaggc 
gttgacgtcc 
tggaaataag 
cagagcaggg 



aacgcttaag 
nnnnnnnnnn 
nnnnnnnnnn 
cttttgcacc 
cagcactttg 
ggccaacatg 
cgcgtgcctg 
actgcactcc 
ataaatatbt 
catgggaggc 
ggaggatccc 
ctacaaaaaa 
gggaggcaga 
caagacacta 
aacaaaagcg 
ttggagggct 
gttaattccg 
tccagccact 
gagggtgcag 
gaaggccaca 
ctgcctggcc 
tcttctcccc 
agtcagactc 
tatgaaatag 
tttccattat 
catgttcgca 
ttaccgggcc 
aaacccagac 
aactgggggg 
ggcaggtaag 
gaagctgatg 
gtctgctctc 
ccttttcctc 
gtaagatctg 
catgcctgcc 
tggagacact 
agtttggtac 
tatttgacag 
cttcgaggta 
ccttggacca 
aataaagcca 
ggtgaagcta 
ctttctggaa 
agctgagaaa 
agagaaagga 
cccacccggt 
tgggaacgta 
caggttccca 
ccgtgaccct 
agaaaagccc 
tgcctacaca 
acattttgtt 
aaacacatgg 
tgtaatccca 
accagcctgg 
atggtggcat 
acctgggagg 
cagagcgaga 
tcttgtccaa 
ttcacagagc 
tgtgaccatg 
cagcccagtt 
ggagggaact 
tcagaaccca 



caccaccctg 
nnnnnnnnnn 
nnnncccact 
cacataaata 
ggaggctgag 
gtgaaacccc 
taatcccagc 
agcctgggcg 
accatgtttt 
tgggcacagt 
ttaaggccag 
taaaaaaatt 
gggagaggtt 
cactccagcc 
acctgtgggt 
caagcgattt 
tctatcatat 
cacgtgtaat 
ttcagaaaga 
ccggcatcgt 
ccagacccca 
ttgtccttgt 
agaagaccca 
tctccacaaa 
cttaaaaaaa 
caaaaatgtc 
taatcgtgtt 
tgctcatgct 
gttggggaga 
ttctcaattg 
caccatctgt 
ccctcaggct 
cctaggggca 
gctcttcaaa 
caccagaaca 
gagcaaaagt 
tgcccaccgt 
aggaaattga 
caccaggttg 
aaccacccat 
gagagtcagg 
aaccactgaa 
gtcacgtgag 
gcttgctgtg 
gagcggagaa 
tcttgtgtca 
atggcagaag 
aacatgttgg 
tgcccagcat 
caaggtaaaa 
gtcacagtgc 
aattttgttt 
aataggtgct 
gcactttggg 
acaacatggt 
gcgcctgtaa 
tggaggttgc 
ctatgtcccc 
cccatttgag 
acattaatgg 
gctgtgtccc 
tctgcctaat 
gagccagggc 
caccaggaga 



cctggccagc 
nnnnnnnnnn 
gcgcctggcc 
gttaccatgg 
ccaggcggat 
gtctccacta 
tattcaggag 
acagaatgag 
gaccacctgt 
ggctcacgcc 
gagttcaaaa 
aactgggcat 
cgcttgagcc 
tgagtgacag 
aggtaggaac 
gccaagcaga 
caccacagcc 
tgacttcctc 
taaggcaggc 
agatcagctg 
ccaaggtaga 
gcccaatttg 
tcattccttt 
gcctgaagtt 
tacatacatt 
catttcatag 
tatcctgccc- 
cagccttttc 
gggaggtgca 
aatgagattc 
cctcttgtcc 
cctgctcaaa 
ggacagagct 
cactacttca 
actgatggcc 
accttcttta 
ggaggttgag 
ggcccccatc 
acctgtccag 
cctctctgga 
gaccttgaac 
ggtttttcag 
tgacagagta 
gcccaggaga 
gtcagacccg 
agacaaaaag 
aggcgtaaga 
gcagctgagg 
tcccagccct 
caaacagaaa 
cctttagctt 
aattccaggt 
taagaaacac 
aagccgaagc 
gaaaccccat 
tcccacctac 
agtgagccga 
accccccaaa 
atgagacaat 
ctttctcccc 
ggtaataggc 
caggacaaat 
agacagtctc 
gaatttaatt 



ggggnnnnnn 
nnnnnnnnnn 
cattactttt 
gctgagcatg 
cacttgaggc 
aaaaatacaa 
gcagaggttg 
actctgtctc 
tatgtgccaa 
tgtaatcgga 
ccagcctagg 
ggtggtgtgt 
cagcagtttt 
agcaagacac 
aggctcatag 
ggtccaagcc 
ctctctgccc 
aggggcagga 
caggccagac 
tgagaggagg 
tcccaggcct 
cagatgagga 
ttctttttca 
gctggccact 
tttgcctatg 
tacgtacaaa 
cttcctggca 
tcacctttcc 
agtgctctgc 
tgtgcaaatg 
tccccatacc 
cctttttctc 
tcaaagggcc 
gttgaaaaga 
cccccaccca 
gtactctttg 
gagcatggca 
ttgcactgag 
gatccagcct 
aggtgtggaa 
gcatgtgaag 
cagtgtgtga 
cagagaaaaa 
gagggggaag 
tgggtcagca 
aaaacccagg 
gccaatcatg 
agggaaagca 
gctccataca 
aatcaatgtg 
caaaaaggct 
ttcccaagtt 
tgatcttggc 
tggtgggaag 
ctccaccaaa 
tccagaggct 
gatcgcacca 
aaaaaagaaa 
tgagacccag 
attgtcgttg 
agtgcctctt 
cacatcctgg 
cagaggaggt 
gatcatgtgt 



nnnnnnnnnn 
nnnnnnnnnn 
aatggcaaaa 
gtggctcagg 
caggagttca 
aaattagctg 
cagttcactg 
aaaaataaat 
ctgtattact 
cactttggaa 
taacacagta 
gcctgtaacc 
aggttgcagt 
tgcctctaaa 
tacagatgag 
gaggtctctc 
cagggagagt 
aaggcttcga 
caggtgaaca 
gggcaggccc 
cagaggcctt 
aaccaagacc 
cttgaggccc 
ggctcaaaat 
actccacaaa 
ggaaacttag 
cattccccag 
caggtcctcc 
ccaagggctc 
tgtcagccct 
atctgaccag 
tgcagtcttg 
acacccccaa 
agggagaact 
tgcgctctct 
taaagtgcaa 
tggctcaaag 
ctaaaacttt 
ggcataaact 
aaatgtggcc 
gggctggact 
gccagttccc 
gaatcagagg 
gcctgcattg 
ctagctgctg 
tggcctcata 
agggccatta 
gcacccatga 
atagctccag 
ggaaactgta 
cccagacacc 
tgttacgtaa 
tgggcgcagt 
cttgaggtca 
aatacaaaaa 
gaggcaggag 
ctgcacttta 
agaaaagaaa 
ggaggaaaag 
tcccagccct 
aaccctctcg 
gaggtgaggg 
ggctctgacg 
tccactcacc 
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tgcctcagcc 
ctcctctggc 
gatccgtaca 
ctcaacagct 
cagcactttg 
gagcaacacg 
cgtgcacctg 
aggcagaggt 
agactccgtc 
ccttccccct 
ttttcctgca 
gcctggagag 
taccgtgggt 
ggcaacagca 
agagcagcgg 
gccagcgccg 
agccgctgga 
agggatttag 
gggagaaagg 
ggcggctgct 
gagataaaca 
taacactcct 
cacacataca 
aatttcatat 
agggacgggg 
gctgctccct 
cgactgacat 
aatgtaacct 
ctagaacaga 
acaccccgac 
cctgcacagt 
cagcccactc 
tgctgccagg 
ccaaagccat 
gcagcagatg 
tccgccttca 
tccacgtccc 
agacacaggg 
acggtccctg 
ggtcgggggc 
aaggaaccaa 
ctgcgctgct 
gctcagcaca 
cctgctttca 
atggggctgc 
cacagcctga 
tgtaatatgc 
catatctaga 
attaaaatgg 
caggcagatc 
tctctaaata 
taaatgatag 
cactgttata 
aatgagaacc 
taacggtaaa 
acaaagggct 
nnnnnnnnnn 
nnnnnnnnnn 
ttccttcctt 
gcacttggtg 
ggctgtcaaa 
ataaaataag 
tctgctgaat 
ttatgaagca 



aagccctcag 
cccaccatcc 
aagcctaaac 
ctggtaaaaa 
ggaggccgag 
gtgaaacccc 
tagtcccagc 
tgcagtgagc 
tcaacaacaa 
tcccctgaag 
ccctgcatcc 
gggcagggcc 
aaggcgaggc 
gctgggcccg 
cccagggggc 
gaccctgcca 
caagccccac 
ggccctgggc 
gatccgcttc 
ggggagaggt 
gtccgagcca 
aagcctttaa 
ccactcacca 
ggttcaacct 
tcacacaccc 
ccccctcaag 
aatttattag 
ccagcaggga 
gctccatgga 
acacagcctc 
ttctcacaca 
atactgccct 
ccaggcctgt 
tcagggacag 
agggcatttg 
tcccacctat 
gggaggccag 
aagggcgagg 
tcctcccgtt 
caaagcggtc 
tggtgaagcc 
catcctgggt 
tgacaccagg 
aatcccatgc 
tccagaaatg 
ggcttcaact 
actcagtgtc 
ggcaaaggca 
cccggcgtgg 
gcttaagcct 
aaaaaaagaa 
tctgataaaa 
agtcagttaa 
aggcactgcc 
ctgccaactt 
gcttcgcttt 
nnnnnnnnnn 
nntccctccc 
ccctccatct 
aaagtttcaa 
ggagacttac 
ggccttttgc 
cagatagcta 
ctccaaagtt 



ggcaggggaa 
tccccaagtc 
acactccaca 
aggcgtttag 
gcgggtggat 
gtctccacta 
tactcgggag 
cgagatcgcg 
aaaaaaaaaa 
tggctggctc 
gtgaggcacc 
ccctcccctc 
cggccggcta 
caagactcag 
ggggccggcc 
ttggctggag 
ccggccgcca 
caagttgcac 
cttcctttag 
gaaattcagc 
gcccggccag 
cgcgtttaca 
catgtaatag 
agtacactca 
actctcccac 
atcatgttac 
tttacttgtt 
ggatgactcg 
cattcatggg 
atccacacac 
ctcacttgat 
caccccactc 
gacactcacc 
cgccaggaat 
gggagctgtg 
tcttttcttg 
ggccatgagg 
ggttggcctg 
ccccatagcc 
catgaggatc 
caggttgagc 
ggtgttggtg 
aacagctggg 
caagtgcctt 
gcagccatta 
agctcaaatg 
caagcttagg 
aaggcagtga 
tagttcatgc 
acaagtttga 
atttagaaag 
taatagctaa 
taaagtatcc 
ctccagtccc 
ggagttgtga 
actcaatgct 
nnnnnnnnnn 
tccctccctc 
tctccacacc 
ttctcctgtg 
ccaatctttc 
atctgattta 

agggggccct 
tgagacacgc 



ggcaaagtca 
actagatccc 
gagggggaaa 
gccgggcgca 
tgcctgagct 
aaatacgaaa 
gctgaggcag 
ccactgcact 
atggtgttta 
aggaaaaacc 
actgccaagg 
caaggggcca 
accccgggct 
cacgggacgt 
agagggagcg 
attacaggag 
gggagggtct 
agcagggaga 
agctgtgaaa 
caaaaccacc 
gaaccttccc 
cactcacata 
gtcaagccat 
caaacacacc 
gacatggcaa 
ccagttttat 
tattgggtta 
gtcagtcctg 
ctctgtacac 
acacagcctc 
ctagtgatct 
actctgccct 
gtgagtgaag 
atcaacggag 
ggaagccaag 
tggggccaca 
gtgcaggacg 
tgagcacccc 
cagccacctc 
cccagtggca 
atctcgtcct 
ctgctctcag 
cacaggagac 

tgggggtacc 

gtacctgacc 
aaatactgga 
gggttgtgga 
gtggtactct 
ctgtaatccc 
gaccagcctg 
aacactaaaa 
tacttattga 
cgttccctag 
ctctagaagt 
caagttaagg 
cannnnnnnn 
nnnnnnnnnn 
cctcccttcc 
tggtatcatc 
tggagaggtg 
cagcccacca 
agtaggaagc 
ggaatctgca 
ctcaaaggtg 



ggatgccctt 
acagctgaga 
ctgagactct 
gtggctcaca 
caggagttcg 
aaattagccg 
gagaattgct 
ccagcctggg 
aacacatata 
tctacccact 
acgccaggga 
caaacgctgt 
ggcggccttg 
cctcgtccaa 
gggagaggct 
gcggggacat 
gaggtcaaga 
aggggctgcg 
tgtccccggt 
cagtcaggca 
ctccaacctc 
aataaacaca 
gtgcacgacg 
taccaactca 
gcgtgcacac 
tttcttccca 
tctgtgcccc 
attgtgctgt 
acaaacacac 
acacctgctc 
gcgtccacag 
caccccactc 
tgaacgttag 
acagagctgg 
ggcgggagct 
agaggacaga 
cagtgaagca 
ccctcccctc 
acctgccaac 
gggtggtggc 
gctggtcaca 
ctgaggaggg 
agcagcccac 
ctagagtcac 
ctgggagagt 
cataaaagta 
cccccaacaa 
aatggctata 
accactgtgg 
ggcaacatgg 
cttagaggaa 
gcacttaact 
gtgatgaagc 
ccacttggag 
agaaaagcta 
nnnnnnnnnn 
nnnnnnnnnn 
ttccttcctt 
atacagaagc 
agcactgagg 
atcccttgcc 
tgattcctga 
ttttagcaag 
gagtggttct 



cgcacacacc 
aggaccttag 
gaagggaggc 
cctgtaatcc 
cgaccagcct 
ggcgtggagg 
tgaacctagg 
cgacactgcg 
actaaattat 
caggcagagg 
aggctgccag 
ctgcgcccag 
cagcgtgcgt 
gtctgggcca 
gaggggcggt 
agcagggagg 
gccggagaga 
cagaggggcg 
tggaattaaa 
gcccttctca 
cctaagcctt 
ctttgagcaa 
aggtgtcgac 
tggctttcac 
gctatctcaa 
gcacctatga 
tcacccccaa 
agtccaggac 
acattaacat 
tttgcagcca 
gcccctcccc 

gg<j<?9 aactc 

gcagatgcca 
aaaggggaaa 
ggggtaaaca 
caactcacct 
ggcactgtgg 
cccctgcagc 
cagccgcacg 
gctgagcacg 
gcctggccac 
ggaagggagg 
agtcaggcgg 
atctcctctg 
cttgtgcaca 
tttactaagt 
gaagtgcccc 
acaagaattc 
gaggctgaga 
taaaacccca 
gctttcccga 
atgctccagg 
tgaggcacag 
gacttgtcct 
gtgataggag 
nnnnnnnnnn 
nnnnnnnnnn 
ccttccttcc 
agagaggact 
aaggggtggg 
cagtgtttct 
gcccctcaga 
cggaggtggg 
gtggggggca 
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gaaaggaaaa 
tggaaagggt 
caagccagac 
cgaagcagac 
cgtggagacc 
ctcatccccg 
gtccatcact 
ttctacccca 
cctttgctat 
ctcgcccctg 
gaccccgcca 
ggattgcgcg 
cgcatatatg 
aggcttcgaa 
cgcaccctct 
ccagctcgct 
cagacgccct 
ggccagaggc 
ggctgcagat 
cgggcagagc 
accctcggag 
cgcacgtggc 
actattcgca 
agccagccga 
aaaaagtcag 
ctttgccaag 
tctctcttat 
aatgctgtct 
actcttggcc 
gctatccttt 
cccttgggtg 
caacaccaac 
atcctaaacc 
tgtcccagag 
ctcttgtcct 
ttatttattt 
aaggaagctc 
actttcggca 
tctaaagcat 
aaatcctggg 
gttgggaaca 
ctagttaatg 
caccaaccgc 
taagacttgc 
gacccccaac 
agggctgtgt 
gcacatggga 
accgctagaa 
aacccctcta 
gaaggttttt 
taaatctacg 
cttttggttg 
gcggtcacct 
tgctggagcc 
aatagcagga 
agtcatctgc 
agccagaata 
ttaatgttca 
agggcctggt 
cactgtagga 
tctgaccttt 
agacacagtt 
gacacgcggg 
gcaaataatg 



tgcaaagggg 
gatgccaggt 
cggggaggag 
acttgcaggg 
ctggagagcc 
ggcaccagcg 
cccccggcca 
cagtccccag 
ttgcgatccc 
ggagggcggc 
ccccgtcccg 
gcagggtctg 
cccccctttc 
attcctcgtt 
cccctcccca 
tccgcccctg 
tctcctagga 
aagcttcgag 
aacgcttgca 
aagccccggg 
ccagccgcag 
gcttaccccc 
aattgaggat 
acccgctggt 
tgttggaagt 
tctccagacc 
taacttctca 
tacagagcca 
ccaggagcag 
ccacagactg 
tgtgccttaa 
ctcttaggct 
aatgttagtt 
gcacaagcag 
cactgatggt 
actttaaagc 
aaaatttgca 
ggccctaggg 
tctgtcctaa 
aagtaggggg 
gggaggtgag 
cccagctgag 
caaagacagg 
aagcagcagc 
ccctaaccca 
ctctctcaca 
cctcaggggt 
ggagataaga 
ggatagggac 
tttcctcctt 
gccatggctc 
ctcaggcctt 
cagagccgct 
gggcacggta 
ggcccagatt 
ccttagcctc 
aatacagcta 
gcaattctgc 
gggtgcccac 
gtctgggctg 
iggagatgtta 
caaggaactg 
ctcfcgactcg 
gcaattctta 



gaaggggtca 
gtggggagta 
ggggccacag 
tgcaccccgc 
cccaaccttg 
gccttcccgg 
gagcccacca 
-tcctagctca 
a^aactcgtt 
cccattaacc 
cggcgggggt 
ccacagggca 
tgggaaaaga 
ccctatcctc 
gccatctgtt 
cccagccccc 
ggccggaatt 
accccccacc 
aggacgggag 
aagaggcagg 
ccacgcagcg 
acccccgggt 
cccggacaca 
gggtgctagc 
cgggagtctg 
cctgaggaca 
agagaaacaa 
acctttggag 
ctgagaaccg 
ccgaggttcc 
cctctctggg 
atagtttgga 
ttcccttcat 
gtgcagggct 
gtcttctctt 
ctcactttaa 
taaagtttta 
atgctatggt 
tcctctgtat 
caggagctgg 
ccagacagcc 
cctgggtgaa 
cgcacaccag 
accagagagg 
ggacacagct 
ttcacacata 
agcctgtttg 
ggcaccctgg 
tgtcttcagt 
tttgcagtct 
tatgtgcatg 
gattgcctgt 
aagcaccttc 
atagaagagg 
cacctttagg 
ccacagggag 
gtacttatta 
aaagtggctg 
accatatggc 
gtcaggctgc 
accaatggga 
ggatgcacag 
gttcacatcc 
ctgagtgcct 



cacttgggga 
acagatagag 
ccaaggtgag 
cctctcttcg 
tttctggggg 
gaggctcaac 
acgctcctcg 
tctgcataaa 
ccccaccccg 
ctcgcgaccc 
ctgggggtga 
gaggccaggg 
cggggagggg 
cggcccccgc 
ccactccgca 
tccccaagcc 
tctgcctcca 
aaccaccacc 
tcggggaggg 
ggttttccct 
ccgcctgccg 
ccgctcctgg 
gagtgcagag 
caattctgat 
ggctcagagc 
ggttttccta 
agacaaaggg* 
gtgggggaga 
gaaagaagct 
aaattgagct 
cttgtttcct 
taaaatgaga 
ttggggactt 
ggataaataa 
gatatagata 
tgttaaaggt 
agataaaata 
gggaagtttg 
ggagaaaagc 
actccctcca 
agaggcgaac 
gaaggatggg 
ccagtctctc 
gaacctgccc 
ggcacctcag 
cacagacaca 
ccgatccccc 
tctcctccaa 
caatggagcg 
ttacaaaaat 
ttacaggtag 
catccaggtc 
agtgggccca 
taggaaggca 
gcaaggagag 
gagaaggcgg 
tgtgtagtca 
agatgagact 
actcactagg 
tcccgaaatg 
tcccgttcag 
cctggtggac 
cactctgcat 
ccttctcagg 



aggtttcaga 
gaggcaaagt 
acaggtcagc 
tggcaatctg 
gtgggtcaga 
acgcagatac 
aggtccgacc 
gctccaatta 
agcccgtttc 
gggccgctcc 
ggggcgcgcc 
ctctccggga 
ggcttctcct 
acccctcctc 
gcgccgcgac 
ccggggagtg 
tctcccaccg 
accgttgcga 
tgtagggcga 
cccgggtcgc 
ggcacaccaa 
ctcgcgctca 
accccggcaa 
tttgtacttt 
agcagggatc 
tctgaaaacg 
agggaaaatg 
tggccaaggc 
tgggacctcc 
ccaccaccta 
acagcgacaa 
tagctgtgta 
gctctaacct 
ggtatgtctt 
attttaaagc 
aaatgtaaat 
ggagactcca 
agtcatacct 
cagcttcctg 
agcactaagg 
gggctggcat 
ggtgtgggga 
acttcccttt 
tcctggccct 
gcccctttcc 
tgcatgtgtg 
caagaggtac 
cccaaggagg 
ttgacttagg 
agaacttctc 
aaaagccata 
ccttggtctg 
tcccattggc 
acaggaatcc 
agaaacagag 
ccatttttct 
ttgttccacc 
tctcaggtat 
taggtatgag 
gggccttctg 
ggtggcgaga 
agaaggcttg 
tactcactgt 
gctgttgtgg 



caataccgag 
gagtggagac 
agccagaaac 
agaccgagga 
gaggaagcct 
ctggataggc 
ttgtccctcc 
acatgttttt 
ccgccgcttc 
tggcggtcct 
ctggggcaga 
aaaaggcagg 
gggagactcc 
ctccccgcca 
aaacacggct 
ggggagtgag 
gggtccggct 
gggccggtga 
gtttaaagga 
cgccccccgc 
ggacctggcg 
gcctccccag 
gcctactgaa 
acaaaaacaa 
tgcgatgtga 
gaggggacag 
gcttagctgg 
ctctgaggtc 
tttctgcaga 
acactgtgtg 
gaaagaatga 
gaacagacag 
ccagggctta 
tctgcaggat 
ttcacgttat 
atagtataac 
aaaaagtgtt 
tagcattctt 
gatgtacccc 
gcagggcatg 
gccaagcgtc 
agacaccccc 
ttatttcctc 
ggaaggggcc 
ttctgaaagg 
cacactcatg 
caggaggcag 
aagaaagctc 
gggcgttttt 
ttggtattta 
tggggcactc 
agaagtctat 
ggcgtactcc 
caggagtgag 
tcaagtaggt 
ccaggtcctg 
agtatctcac 
aacaagtggc 
gaaggcacag 
ggctcacccc 
ggaggctctc 
gaaggcccag 
gtgactttgg 
cgaagatgta 
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agttaaaaaa 
ttatttcatc 
gaaagatcag 
gcagggctgc 
aagggtcccc 
ccaatggtgc 
gtttactaaa 
actctgggag 
aacatggcaa 
gtagccatca 
caggaatcag 
aacaagacac 
atgtaatccc 
ccatcctggc 
ttagctgggc 
aatggcatga 
gcctgggcaa 
aaaataacaa 
tcaggggact 
cttggaaacc 
gcaccacata 
aaccctcgct 
tggctgctga 
aaaaggggtt 
tcagcccagg 
nnnnnnnnnn 
nnnnnnnnnn 
aagaaaaaaa 
tcaatgtttg 
aaaagtttat 
aaaatatctt 
ttttattttt 
tcttggctca 
aagtagctgg 
ttgaaatgga 
tgcaacctcc 
attacaggca 
tcaactaaag 
cccacctcgg 
tacagtgttt 
actactcact 
agtgccctat 
tctatgatta 
agtactatgt 
atactgtcta 
atttctctga 
ctcaaatttt 
ggcgcaatct 
gcctcctgag 
ttttattaag 
ggtgatctgc 
cagccattac 
tgtcatcaag 
gctcaaggga 
atgctcagct 
ctcaggtttg 
ctgggattac 
ccacttatgt 
ttgcttaagg 
tatgtgtgtg 
cacaaacatg 
gagtacttgt 
atgccctgaa 
ttcctgcaca 



aagtatgcat 
acaattcttt 
cgtcagggag 
ctcgtgcagg 
tttcaggggt 
ctgaacacag 
aacagactcc 
gccaaggtga 
gaccctgttt 
cctgtggttg 
aggctgcagt 
cttctctaca 
agcactttgg 
tagcacggtg 
gtggtggtgg 
acccgggagg 
cagaatgaga 
taaattaaaa 
ggggaatctg 
tgcagattag 



caaggcccta 
ctcacccaga 
agcaaggcga 
ggatgggaac 
caagaacttt 
nnnnnnnnnn 
nnnnnnnnnn 
aatcttaaag 
tatttcaagc 
aaagtaaaaa 
tgtataaact 
ttgaaatgga 
cggcaagctc 
gattataggc 
attttgctct 
acctcccagg 
tgcaccacca 
agaaccatgt 
cctcccaaag 
ataaagcctc 
cactcactca 
ataggtgaac 
gctacacaaa 
atgggtttgt 
gtttgtacac 
gtgtattcct 
tttttctttt 
cggcgcactg 
tagttaggac 
agatggggtt 
ccgccttggc 
gtttttttgg 
gctggagtgc 
ccctcctgcc 
aattattttt 
tctcaaactc 
aggtgtgagc 
tttcaagtta 
gatctgcagg 
tttatgtacg 
taattataag 
gccagtctgg 
gtacggtagg 
agctcagctg 



catgcttagc 
gggaggaggg 
gcagatgagt 
agtgggtgcc 
gctggagaca 
gcccaagagt 
taggaggtca 
gaagatcatt 
atctgtacaa 
cagctactca 
gagctgtgat 
aaaaaaaaaa 
gaggctgagg 
aaaccccgtc 
gcgcctgtgg 
cggagcttgc 
ctccgtctca 
acaaaaacag 
tatttttaac 
agacctctga 
cccctgcccc 

agggctctcc 

acacagcact 
ctgtccccaa 
ttcttttcct 
nnnnnnnnnn 
nncactccag 
aataaggata 
caagtgttat 
agctaagtaa 
tagtgtagtc 
gtttcactct 
tgtctcctgg 
acccgccacc 
ttgacccagg 
ttcaagagat 
cactcggcta 
tggccaggct 
tgctgggatt 
cagtagtgta 
cccagagcaa 
cattttttaa 
tgcttaccat 
agcctaggct 
tctatcatgt 
gttgttaagc 
gatacagggt 
cgacatctac 
tacaggcacg 
ttgccatgtt 
ctcccaacat 
ttgtttaatt 
aatggcacaa 
tcagccttcc 
tatcttttat 
ctgggctcaa 
ctgaaaacct 
agcaaggttt 
cctgacctcg 
tgcatgatgg 
cagggataaa 
tgatcaggaa 
agttggttag 
ggcactgagg 



acatagtgag 
tttacgtgtg 
tcaatgtaag 
ctaccactga 
ggatcccaca 
caggactggc 
ggcactgtgg 
tgaggccagg 
aatttttttt 
gaaggctggg 
tttaccaccg 
aacaataggg 
agggcagatc 
tctactaaaa 
tcccagctac 
agtgagccga 
aaaaataaaa 
actcctacgg 
actccgtgag 
ggtgcctctg 
caggagagtg 
tttcaggggt 
ggtgagggac 
aacgggagat 
tcccaacatg 
nnnnnnnnnn 
cttgggtgac 
taaagaaaga 
tacaaaacag 
gctagggtta 
taagtgtaca 
tgttgcccag 
gttcaagcga 
atgcatggct 
ctggagtgca 
tctcctgcct 
atttttgtat 
ggtctagaat 
gcaggcatga 
cagcaatgtc 
ctgccagtcc 
tattttatac 
tgtgttacaa 
ataccatgtt 
ttgcataaag 
aacacatgta 
cttgctttgt 
ctccccggtt 
caccaccacg 
ggctaggctg 
gctgggatta 
tttttttttt 
ccatagctca 
cagtaactga 
tttttgtaga 
tcaattctcc 
tctagtgtgg 
aggtcactta 
gttttcattt 
ggggaaagct 
ttctatgatg 
caatgtcctt 
gggtgaggca 
acccagtgga 



tgcttggtaa 
ggtggcccca 
gaaaagactt 
gggtatctaa 
ctaggtagaa 
cacttcacaa 
ctcacgcctg 
agtttaagac 
taaaaaatta 
gccggaggat 
cactccagac 
ccgggcgcgg 
acgaggtcgg 
atccaaaaaa 
ttgagaggct 
gatcgcacca 
ataaaaataa 
tcaggctgag 
gggttctaaa 
gctgagatga 
gctcctgctc 
cccaccatcc 
tgcaggcctg 
caaagggtgg 
ggnnnnnnnn 
nnnnnnnnnn 
agagtgaaac 
aaatattttt 
tcaaaagttt 
atttttttat 
ttgtttttat 
gctggagtgc 
ttctcctgcc 
agtttctttg 
atggtgcaat 
cagcctcctg 
ttttagtaga 
tcctgacctc 
gccaccatgc 
ctagaccttc 
tgcaagctgc 
tatattttta 
ctgcctacag 
gcctacgtgt 
ataaaatcac 
taaacattta 
cacccaggct 
caatcgattc 
cctggctaat 
gtctcgaact 
caggcatgag 
taagagacag 
ctgcagcctc 
gactacaggt 

gggggggtct 

tgctttggcc 
aagtggaaga 
tgaagcctga 
gttttaacag 
cagaaatcaa 
aagaagtatg 
tgggaagtga 
gtaagaccca 
ctctgctaca 



atagaagcag 
cagggcagat 
actaacagca 
gctaagaggg 
ctggattgga 
agcacctgga 
taaccccagc 
tagcctgtgc 
gccaggtatg 
cgcttgagcc 
tgggcaacag 
tggctaaggc 
gagatcgagg 
aaaaaaaaaa 
gaggcaggag . 
ctgcactcca 
ataaataaat 
atatcctgat 
aggcagacaa 
gtgagggatg 
cccccacacc 
ccatgaaaag 
tcagcgtccc 
tgggggcctt 
nnnnnnnnnn 
nnnnnnnnnn 
cctgtctcaa 
gtgtagctgt 
ttaaaaattt 
cgaacaaaga 
tttatttatt 
aatggcatga 
tcagcctccc 
catttttttt 
ctgggctaaa 
agtagctggg 
gacagggttc 
aggtgatcca 
ccagccagta 
acattcactt 
atgcatgata 
ctgcaccttt 
taatcagtac 
gtagtcgtct 
ctaatgacac 
caagaaatag 
ggagtgcagt 
tccggcctta 
ttttttgtat 
cctgacctca 
ccaccatgcc 
attctcactc 
caactcctgg 
gtgagccacc 
ttctatgttg 
tcccaaaggg 
taggcccagg 
ctagttttgt 
tgtctatgtg 
gtaagccaaa 
ggccacggga 
catttgagcc 
gagctggggc 
gggcagtgag 
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gagcagaaag gctgaggaag gctgggtgtg gtggctcaca cttgtaatcc cagggctttg 43200 

agaggctgat gggggaaaat cggtagagct caggagtttg agaccagcct gagcaacata 43260 

gcaagactcc atccctgtaa aaagctttta aaaattagct gggtgtggtg gtatgcatct 43320 

gcagtctcag ctactcaaga ggctggggta aggattgctt gagcctagga ggtggacgct 43380 

gcagtgcgcc acgattgtgc cactgtactc caacctagga gacaaagcga gatcctgtct 43440 

caaaactgaa tgaataggct gtgtgcggtg gctcactcct gtaatcccag cacttttgga 43500 

ggctgaggtg ggtggatcac ctgtgattgg gagtttgaga ccagcctggc caatatggtg 43560 

aaacccgata caaaaattaa ctgggcatgg tggctcacat ctgtaattcc agctactcgg 43620 

gaggctgagg catgagaatg tcttgaaccc ggggggcaga gggtgcagtg agctgagatc 43680 

gcaccactgc actccagcct gggagacagc gagactccat ctcaaaaaaa aaataataat 43740 

aataacaatt aaaaaaaaat taaaaggcca gggagcactg gcagcctgtc caaggtttca 43800 

ggtcacttta gtaaagggag aacaatggct cctcccagga cctctgggat ctcagcattg 43860 

atacgacagt catggaaatg ctagggccca ggcagaccat ctcagggaaa acaagtggct 43920 

ctgccctgcc ttggccactt cctggccctc tgcatgcccc agggtctcag caccaagctg 43980 

ttctcagtga gtagctctca tttagtgcca gggctctcgg gcttacatcc tacgatgacg 44040 

atggaatgca taaaagatgg ggctgtgata gcccagagct aggggtttga atctcatgag 44100 

atgttcatgg agccctggga gggagctcag tgcaagttca tttctctttt ttggttgaga 44160 

tggggctcag aggaggaagg acttgttcaa agacacacag ggagtgtttc agtgtgggac 44220 

ggaggtttat ggagaaaggg tgaccatcca aggcttggac aaagatcatg acttcgacca 442 80 

gcaagcctca actctgtaga cttggtgggg gccaggccct cccaaacaca cctgacaggt 44340 

gtctgtggtc ttggggacat tgtcgctccc cttcctgctg atgctctgct gtccctctcc 44400 

catgaagcgt atctcttcgc cgtcccccat ccttgctgag agaggatggg ttctcttctg 44460 

accaatactg aagatcttta gtaaagttct cttttttttc attttctgaa agtccctctc 44520 

ttgagaaatc aggacaagtg agtcagggcc aggacaaaaa acagtgtggg acgagtgtgg 44580 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tggcggatca cttgaggtca 44640 

tgagtttgag actagcctgg ccaacatggt gaaacctcgt ctctacaaaa tacaaaaatt 447 00 

agccaggcgt ggtggtgcat gcctgtaatc ccagctattc- gggaggctga ggcaggagaa 44760 

tcacatgaac ccaggaggcg gaggttgcag cgagctgaaa ttgggccact gcactctggc 44 820 . 

ctcttggcaa cagagccaga ctacctctca aaacaaaaac aaaaacaaac gacaaacagt 44880 

gtagactttg tgtttttctc aaaagcactg tcaagccagt gcccgcagca gtgggcctag 44940 

acacctccag tcttgcctca gggtcagttt ccagcctccc tggacacttc ccccaggtat 45000 

gtgtactttt tgattgtcct aaatccagag tctgtggcct gacctggttt gtcacagctc 45060 

tcagtccctc cccatcccga atcccaggga gccgcaggtg tgtgcagaag aggcacacca 45120 

cactcaatac atcttgcatc ctcgctggac ccaatccatt ggcttggtga tgtacagact 45180 

gagcctcatt atagccgttc gttcctgttg acctttccag atcaatctgc cagcttggct 45240 

tctccgagtt tcgcttgtca gcatttctcc aatcccatca tgtactttgg acctctttgt 45300 

tgggtggctt gctttatctg aaattttcag atttgacttc aggtctctcc tttgtccctt 45360 

aatatggctt aatggtggac cctgtcaggg gtagagaaaa tattgaggag ccctgacttt 45420 

gaggtgcaca agttagaggg ttagacaagt ccagccacaa ccagcccaag ctgcagtgta 45480 

gggaggcctg tccagctgct ccacggtfcga gggtggagca tacaggaagg cttccttctt 45540 

gctgcagccc aggtgttctg gctgccctag ctgcctggct ttggtagaag aaagaaaggc 45600 

tctgtctctg acttgtcaac taatggcact atgagattgc acataattaa cctgggtctg 45660 

ctcttccaaa agccttgggc ctctgactgc aacatggagt ctgggtatca ctccccatcc 45720 

ctgcgccact cacctgctct ggcgctaggc gtgtgcctaa tcacttaatt tctctgtgct 45780 

gcctcttagg tatcacttcc cctgatccca aatacttacc aggtgtggga tgacacctga 45840 

ctagttactc cttggaggta tctgcttctc accggggact ccgaaaccaa acgaaaagca 45900 

aggccaagcc cagcctaaag gacgcttcct acatgacttc aggcttgcgg gggctggagc 45960 

gtgggggtgg caatggagtt ggggggggct cagggagggg atgtggaagt gctttgcttt 46020 

gcaaactcta gagaaccgtg taaataggag tgattattct gtcccttccc tttctttcca 46080 

acaggaatca gcatcccaca gcccatgttc agctatgaag aatggaaact gaggctccgg 46140 

gaggggtata gggaggagcc agcagggtct tgagttcata ttagtgccct ttcctccata 46200 

ggcacatctg tgttttcttt tattttattt tgaatttaat tttttttttt tttggcagag 46260 

tcttgctctg tcgcccaggc tggagtgcag tggcgcggtc tcagttcact gcaatctccg 46320 

cctcctgggt tcaagtgatt ctcctgcctc agcctcccga gtagctggga ttacaggtgt 46380 

acaccaccac acccagctga tttttgcaat tttagtagag acagggtttc acagtgttgg 4 6440 

ccaggcttgt cttgaaatcc tgacctcaag tgatctgcta gcctcggcct cccaaagtgc 46500 

tggtattata ggtgtgagcc actgcgctcg gccacatctg tgttttaaat gagaggaaag 4 6560 

gggataatgt gcattttgtg gaagcttggg ccgtttgtgt ctaggactct tatgatcttc 46620 

ataagttttc ccccagggag gacactgttc cacttaggga gtcaggaccc ccagtcctta 46680 

caagattcag cctctcaaaa tggagacagc agttccaggc ctgggctggg ttctgttcac 46740 

actaggagag ggcaagtgag tggtgtttgg gatgtgggga agtattatga aaacagagat 46800 

gctccaattc ctagtgatag gaaaccatta agctacttgg catcttaaaa ccaagagcgg 46860 

ttcaagttct gagattgtta acacacctta caacaccgcc gccgttatta ggaagaagct 46920 

ctgtttgatg acgtcccaca ctgtgggtac ctttatgaac aggaatttgc tttttcaaat 46980 
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cccagagaag 
gaattcaaga 
gttcagagct 
ggtagagctc 
tggctcatgc 
gagtttgaga 
agccaggcat 
ttgcttgaac 
ctgggcaaca 
acatactgtt 
cagctatata 
cctcctcagc 
tctaactttt 
aggcatgatg 
ttgagcccag 
aaaataaatt 
gagaggagaa 
gcactgcagc 
cgcggtggct 
gtcaggagat 
aaaaaagaaa 
ttgggaggct 
gatcgcacca 
ggaaaaagaa 
tatcacatct 
atgcagtatt 
catttgggct 
cactacttct 
cctgatcata 
cttttttgtg 
ggtggaaggg 
cacctggttg 
tatgtggtta 
gacaacaaaa 
nnnnimnnnn 
nnnnnnnnnn 
ggacatttgg 
actagtatct 
ggcctctgta 
agtaggcgct 
ctggtgtgat 
agaggtggag 
gatccaggtg 
tggctggatg 
aggggtgaag 
ctgagaaagt 
gtgtgtgttg 
ggagaagcat 
ggagatgaag 
atgacataac 
atccccaata 
aatggccaca 
atcactactg 
ggggtagtca 
acaagactgc 
ccacttgctg 
taacctccaa 
atggttggcc 
gtctgttgag 
actttaagtg 
acttctgagt 
ttatttatca 
ataaatgaat 
gaggctcaga 



taagattaaa 
atgactgacc 
ctcccaatag 
aggaatctaa 
ctgtaatccc 
ccagcctggc 
ggtggcagac 
ctgggaggca 
gagggagact 
acaggcacag 
actaccacct 
agaggtaaca 
ctatcaatta 
actcatgcct 
gagttggaga 
agctggtcat 
tctcttgagc 
ctaggtgaca 
caagcctgta 
ggagaccatc 
gaaaaaaaaa 
gaggcaagag 
ttgcactcca 
aaaatatata 
gcgaaattaa 
ccgtcgtgtg 
gttaccaggt 
ggtgagcgta 
aggtaggtgt 
tatttctggt 
aagcaggagt 
gcatggagct 
gctccatgca 
gcfcgatgtgg 
nnnnnnnnnn 
nnnnnnnnnc 
taatgtctgc 
ttttggtaga 
acaaaaaagt 
aaaacataag 
tagctctggg 
gagggtggta 
aaaaacggtc 
agatggtgaa 
gatgaccaga 
gaccacaaaa 
agttcagtct 
cttgggtgac 
acttgggaat 
ccgtggtggg 
tttaaccatc 
aaggctactg 
cagaggtcaa 
gtgggcacct 
tatggtaaga 
gaggagcttg 
gaaacacaaa 
cctagaaatg 
cccaggctaa 
atcctgacaa 
atctcctcaa 
gctaccttaa 
gaatcaatcg 
aaatacctgt 



gttggctgtt 
atacagaatg 
tcacccctga 
aatccatttt 
agcactttgg 
cacatggtga 
acctgtagtc 
gaggttgcag 
ctgtctcaaa 
accttaagtg 
atatcaagac 
gacccacacc 
gttttgccca 
gtaatctcag 
ctactctgga 
ggtggtgcgt 
ctgggaggtt 
gagtgagatt 
atcccagcac 
ctggctaaca 
ttagccaggc 
aatggtgtga 
gcctgggcga 
tacattgtgt 
tctacactgt 
actacgggac 
tctggctgtt 
tgcactcatt 
gttggctttg 
tagagcggaa 
cggtttctgg 
ggcttctcct 
atgaacccgg 
atttaaaggc 
nnnnnnnnnn 
cagcagtggt 
agacattttt 
ggctagagag 
atccagtcaa 
gagactgtgc 
ttttagaaag 
agactggagg 
agcaggtgac 
gaaagcacta 
gtcctgcctt 
agtgaagcag 
gagatgtsgtt 
catatgtgtg 
gatctgcgta 
tgtgcagaat 
tggaagaata 
agaagggaag 
gtaggataag 
gggcaaaagc 
ggaggaagag 
gctttggtgc 
atcatccata 
ccatcccact 
gcgctacctc 
cactgaaaat 
tatattgcct 
acctccctgc 
atgatccaga 
agaatcgaaa 



ctccatcctt 
gggagcaaac 
actgcacccg 
aaaattaaag 
gaggccgagg 
aaccccgtct 
ccagctattc 
taagccaaga 
aaaaaaaaaa 
tacagcccaa 
acattccagg 
tctcctgctc 
ttcttgagct 
cactttggga 
caacatagtg 
gcctgtagtc 
gaggctgaag 
ctgcctcaaa 
tttgggaggc 
cggtgaaacc 
atggtggcgg 
acccgggagg 
cagagtaaga 
actttttggc 
gtgtatgaaa 
aatttgctta 
atgaataaag 
tcgcttatgt 
taatgtgctg' 
catgagggtg 
ctcacacatg 
ttggcgttgc 
gcttctgcaa 
ttcagttcan 
nnnnnnnnnn 
tctcaactga 
tgattgtcac 
gctgctaaac 
aaatgtccac 
ctgagagcaa 
ctcattttgg 
cagggaaagt 
taggaaagtg 
taactaacta 
gcaggtctag 
gtttgtgcgt 
ggactcacaa 
agtctgcagc 
tatatttggt 
taggagagac 
agaggagcct 
cagttcttaa 
aactgaagaa 
agttttggtg 
ggtgttgagg 
aaagcagaga 
tcctggctca 
tctcctctgc 
ctcaagcaag 
gtgtgtctct 
tgttttacta 
aactagagat 
gcctggtaga 
taaatgcatg 



gaaaaatttg 
ttgggaagaa 
gaccatcagt 
tatatcgggg 
tgggaggatc 
ctactaacaa 
ggaaggctga 
ttgtgccact 
aaaaaattaa 
tgaaatttta 
aactcagact 
cggtggtaat 
tcacacagat 
ggccgagacg 
agacccccga 
ttagctattt 
tgagccgtga 
aaagaaaaaa 
caaggcgggc 
ctgtctctac 
gctcttgtag 
cagagcttgc 
ctctgtctca 
atctggttta 
ggttggttct 
tccgtattcc 
ttgctatgga 
aaatatcttg 
acttggttat 
tctcttcagg 
ttgtgactga 
ctactgttgg 
aatacattaa 
nnnnnnnnnn 



nnnnnnnnnn 
gcatagtttt 
agcccagccg 
atctaacaat 
agtgttgaga 
gaaggagtaa 
ctgcttgtag 
aatttgggag 
gcagaggcaa 
atgtgtggat 
ttggaaggtg 
gtgtgtgtgt 
tgtccatggg 
tcagaaacag 
agcttgagcc 
gtgcaccaag 
gccaacagaa 
gaagggggaa 
tgtctgttgg 
gagcaatagg 
aagtggccag 
agccagctca 
aattccagca 
ttatcctatc 
ccttctctgc 
tccattcatg 
atatgctcgt 
tctctttaag 
ggcttgtgtc 
tgtgctctga 



gttttagggt 
agaaggcaca 
tatctctgtg 
ctgggcgcgg 
acgaggtcag 
tacaaaaatt 
gtcagaagaa 
gcactccagc 
agtatgtcat 
cacatctata 
ccatcatacc 
taaccactat 
atacattgtc 
ggagtatcac 
ctctacaaaa 
gagacgctga 
ttgcaccact 
tatggccggg 
ggatcacgag 
taaaaataca 
tcccagttac 
agtgagccga 
aaaaaaaaaa 
ttttgctcaa 
ttttgttgtg 
tatcggtggg 
tattcttgta 
ggtggaatta 
gctgaattcc 
gaatctggag 
actgctggta 
ggcaggtgtg 
caacgacaga 
nnnnnnnnnn 
nnnnnnnnnn 
gcctcagagg 
agaaggtact 
gcacaggaca 
ggtttaggta 
ttggaaagtg 
acagtgcatc 
ccactgaaat 
tggggatggg 
gatgggcagg 
atggtttctc 
gtgtgtgtgt 
acatccaagt 
gcctggggct 
acaagagtag 
aagccaggtg 
attgggaggg 
gtgaagaggt 
gtttggcaat 
gataacagaa 
cgagtctaca 
ctcattgact 
ctaccaggag 
ctatctgtca 
ctgccgtcac 
ttagttctac 
tctgtttgcc 
tatttgttga 
catggtggat 
tctaaactca 
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gctaaacttt ctccaggggg taaagttcaa gttgattagt caattgatta attaattcat 508 80 

tatgtaatgg aaaaactcct tctatgacct gggcagagtt ataggcagtg aacaagacag 50940 

acaaggtcct tgttgtcatg aagtttgctt tctgaaggag agagataata aacaagaaac 51000 

cagtaagaaa gcaagattat atcattttgg taaatgttct tgtggaaata aatgtgatga 51060 

tgtgtaacaa aagtaccaaa taggagagtg gggtgggtgg gcttctttta gaaagagttc 51120 

tcggagaagg cttatctgag gaggtggcct tttaaccagt acaaatgctt tagcttggcc 5118 0 

agtggagctg ggaccaggat gacaagggtc acttgtcatg ccagtgagtt tgagcttgta 51240 

gacaagagcc tgatcatgaa agactttgca gatggtggta atgggtttgg gttaattgct 51300 

actatgtggg aagactttga atgggaagca tggggacaat ggcctgtgat acatgttatc 51360 

aaatatggtc gcaggggcta gtgaggtggc agcagagata gggagaagta gacggactgg 51420 

ggaaggtaga agatggggca ggggaggcaa ttactgcaaa gacatattcc ttctaagctc 51480 

actgagtgtt catggtctct gggagcagag gttcctggag gggaaagagg ataatgtcac 51540 

ttcctgagga agcgggaaga acccatctga gacgtgggga ctgtgctggt tcgtttctaa 51600 

ggggccttcc agatctcaca tgccaatcgt cttggtctat gtcaattgtt ggggcatcca 51660 

aatggggaac tgttgtccag gccgatttca cagaacaacc gcccagtcca tatctcccga 51720 

gccattcacc cttgcagtgg cgttagctct ttcaccagct tttatctgcc ccgtggggat 51780 

gttggccaag cccagttaac aagcagttga tcagccccag agatcaggtc cctggagtct 51840 

gtcacttttc tgagggtggg gagagaatcc tggagcagaa catgtaacta gaagggccac 51900 

ctggcttcct atggtctgag ggagagaatg gtgggatctc tggcctgaat caaacctccc 51960 

tttctcagtg tccatcttac ctctctgctg taccttcgtt attttccagc agctcctcag 52020 

cccgttcctg tgggaccctt ctctgccaat ccctacaccc actgtaaatt tcaccgtggg 52080 

agggagatgg gccttgaggg ctgtattagt cttctattct gcataacaaa ttgcctcaaa 52140 

tttagcagct tcaaacaact catgtttatt agctcatcgt gagttcatca gcagtgtggg 52200 

cccagcatgg ctaggttttc tgctcagggt ctcacaaggc taaaatcaag atgttgtctg 52260 

ggctgtgtgc tcatctggag tttagggttc tcttccaggc tcacgtggtt gtggcagaat 52320 

tctgttccct ggagttgcag ggctgaggtc ctgttttctt gctgactgtc agatgagggc 52380 

tgctctcagg tcctcgaggc tgcccacatt gcttgccacg- tgcgtggtct tttccatcct 52440 

tgaagccagt gatggagaat ttcccttgga ttgaatcacc cacatggttg gactctctga 52500 

cttcaggaag agagccctgt ctcttttatg ggatcacctg' attagatcat acccatagag 52560 

ggcagttcct tttccttaaa gtcaactgtg gcatgtaaca tcacacaacc acaggagtaa 52620 

aatccatcat atttacagtc ccagggatta tgcacagtgc accaggggac aactgaattc 52680 

tgcctgtcaa aagggccaag caggacttta ttggtgaaga acagtggaat gtcattcttg 52740 

gttcttccag aaaaaaatca ctcagtaaag ttagaggttc tcttgccttt tgggaagtca 52800 

tcaaagaatc tcatggaggg tttggacctt caccctagaa acatcacacc atgttttcta 52860 

taattgcagg gttcatggtc ccttgaagcc tattcatagt ttccaggttg aaaagctctg 52920 

ctgcagggtg tggggaggga tgcaggtgga ggtgagggct gaatagtgtg agctgcatat 52980 

ctggagctgt ggtggttttt ttagtcttta agctgtcatg tgttgggggt tgggcatggg 53040 

aggggcatcc caagagctcc ttggtattga caccatctcc aaggtgatct ctgctctgcc 53100 

tggtgcacac atgtttttct cctgttgcaa cagcccactc ttgtagaaga gcagacccct 53160 

cagtaccagg tctgaccctg gacagcttgt accaggagct acagcacact cccccacaag 53220 

cctaaagttg ggatgagccc cccgagaatt agatcagaaa agattaaatg cagaggtgat 53280 

ctgtcaggtc ccctttggaa gtgctggtat ggagaggatt gactgagtct gtttaggaac 53340 

ctccaagctc tgtagtaact ttagggctag aaaggaggat gcctaagatt caggatcctg 53400 

cagtgatgag tcaacatttc ttggggaagg aggcagggct gaggattaaa cggagatgat 53460 

gggtatcgtt ctcttgctca aaggcactgg accccaaggc ctccagctct tcgctcccat 53520 

ttgaaattca agtcctgagc acaccacagt tgtgatgcag ggaaagaatg tgcttatcag 53580 

agagcctggg caagtgggcc ccttgtgagt accgttcaac ctcatttatg tcattggcac 53640 

caaaagtaga catcagtctc ttgaaagttt gattaatgct ggtcacactc aaagaccctg 53700 

ggtagcattc atttactaag caattactaa ataccagttt ctgtgctaaa tgctgcatca 537 60 

gtcagggctc ttaatggcag gcagcagaaa ctctccttgg ctgatctaag tagaaaaatc 53820 

caggactgaa aggaaacgga gtagctcatg aaattgcagg aagggccgga aaaccagaca 53880 

tggagccaaa gtcaggctgc agaacaggtc tagggaggat cccactgctg ctgagaccta 53940 

gaccttgtgt ctggcaccca ggatgttgta gggctcagac cctggatcaa tgtatcctgc 54000 

agtgcctctg tgggtactgc aactccagga actcaatctt gtcaacgcca ccgccagaga 54060 

gaggccttct tggcctccat ctttttggtc actagctcca gattcaaaat cttgaataga 54120 

tgcttcttct ctttgataga gcccagtcat atgcgttagc tgcaaaggaa gctgaaaatc 54180 

tattaggaac ttttgtcttc aaaaatgaga ggcctgtcct ccaccaagat ccataggaaa 54240 

tggaatccaa gaaaccacag gaaggggtga ggtgactggg cagctcacag catgcatgct 54300 

acatgtgaat tatctcattc atttctcaca ctacccagtg aggtaggtat tgtcatccct 54360 

acttcataaa tgatgatatg aggtacagaa agtttaagga acttgcccag gacacgacac 54420 

gcagctatta agtgctagac ccagtcaatt tgagtctgac ttggactgtc tgactccaga 54480 

agccaccctc tcagacactg ctgtatactt ccagtgaatg ttgatgaaat tttcagggtt 54540 

gctaagctgt ggatttcaga tcctggattg tatgacctaa aagagagact tccctaggag 54600 

tgagggtccc tgaacagtca actggtttcc aagaatgggc tccctctcat caccttatga 54660 
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cagtaatcct 
cgcagagccc 
ttctgtcttt 
aaatgcatcg 
atgaaggaaa 
tcaagctgtt 
ccaggtagga 
gacttccttc 
tttgacctca 
tgcttcactc 
atgggagggg 
ttacgtgagt 
atgtgccggg 
caccatcacg 
aaatgactaa 
atccaataag 
ttggaaaatg 
taatacttgc 
tctcgcccaa 
gttcaagtga 
atgcctgact 
gtctcaacct 
caggcatgag 
atgttaccat 
aagttagaca 
cccatcttcc 
cgctagacct 
ctttaataag 
aatgcaaagt 
cccctgcttc 
catcaatggt 
ttgtaaaact 
tcacagaaac 
taacattctg 
attgttttca 
tataggcgca 
ttttttattt 
accagtaaaa 
gcccaggata 
gtatcgggag 
ggtccttatc 
gttcctccaa 
tttgggaaga 
gtggagctgg 
agggctggca 
tcatcctctg 
ggagatggtg 
taccttctaa 
ttatgtcata 
tttacttccg 
gtttagaaag 
gcttttctgg 
gtcccctaat 
ctgtgttctc 
tttggggcag 
ctgttttttc 
tcgtcttcta 
tgaaaggatt 
aattttgtcc 
ctctcttagc 
ggtggatgat 
accaccttga 
ccttcccagg 
ctggggagga 



ctgtccaaca 
agctcaaagc 
ttaaatcaat 
ttcctcgttt 
ttacagtgtc 
ccttctactt 
atcagctcct 
tgaattagca 
atttactcat 
tgattttatc 
agagaagttt 
acgttactta 
cactgtacta 
aaggaagaca 
ctgataagga 
ggcagtgttg 
gggtatcaac 
catgaacagt 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccactgcatc 
aatgaaggct 
tacgtgaaca 
atcacagtca 
attagccaag 
gcagcctcaa 
tataaaaaga 
tggaggcaac 
cacataagga 
attcacatac 
atctcttcag 
tagaatagag 
tttctacaaa 
tttatgccta 
taataggtat 
caggggagat 
agaaccattg 
agaaatcttc 
agccttgcta 
ggcctctata 
tctaaagacc 
tgacagttta 
ggcgtctacc 
ccttgcacat 
gagacatgga 
gtcctatgtg 
gactctgagg 
ggcctgaagg 
aggtccttgc 
cccatcctga 
ggtgtgattg 
cgttgcagga 
tttctgctct 
cctttctatg 
tttctgaatt 
tctcaaggcg 
tttctccatt 
atcgccatga 
ggaaccagta 
ccttgtcagg 
cagagttgag 
acaaggggag 



gccaaagagg 
tcctgactag 
ggagacaggg 
ttctcattcc 
ctgtgcatat 
gaaatgcctg 
tggttcatgg 
gaggatgttt 
cttacaaatt 
tccacctgga 
ggtgccagaa 
accattaatt 
agcaccaagg 
gatgttaatc 
caagcccctg 
cggggagatc 
cacctacctc 
attttgttga 
agtggtgcaa 
tcagactccc 
tttttagtag 
aaagatccac 
cagccacttg 
ggtaggttgg 
tttcctgtac 
gctgggaagc 
cagaagtttc 
aattacagat 
agataaacaa 
aaggtcaact 
tgctcacaga 
taaatacttt 
gtttatagat 
aggacacatt 
tagtcaagga 
aaggatagtc 
ggacagattt 
atgtattttg 
cccagagctg 
actgaatgaa 
tctccctctg 
taagaagtct 
caggaaggtc 
ttctcccagg 
tgtggctttc 
tccttctgcg 
ggagggtgtg 
ttcctaggaa 
gacccattaa 
aaagccagcc 
gtcctggatc 
tttctattca 
tgatgtgtgt 
taaagccaag 
tcatctaagt: 
gtagatgggg 
tctgtgtgct 
tctgttatgt 
atgcttactt 
agtaggtggt 
cccaagccca 
cagcttgggt 
cactctgatg 
atgtgaagga 



tcctgtgggg 
gctcttgttg 
gagggttatc 
ttcccaatgt 
accaaggctg 
tttccttccc 
agccttttct 
cctagcttgg 
aggttgtaag 
tagcatcatt 
agaactggat 
acttcaatgt 
atacaatggt 
cattaaccaa 
gagctacaag 
aggagccaca 
actaggtttt 
ttgatgattg 
tctcagctca 
aagtagctgg 
agacaaggct 
ccacctcagc 
ccatgcatgg 
ccaactgagt 
ttgaagatcc 
ttttaaaaaa 
tgggcagggc- 
tcagcacgca 
caatctgtct 
atttggtgtg 
taagcaccta 
cctcagtatc 
acaggtccag 
ttactcagtg 
ataacataac 
tcaaaagagt 
gttctcaaaa 
gaaaagcacc 
ttccagatgg 
tgagtgggct 
acagaggcaa 
ttgtgagagg 
tctgggtggg 
aggtccctgg 
aggttatgag 
ggatgtgaaa 
ggtggcttct 
agccaataat 
gtacaaacaa 
tcagccaccc 
ctggagcatc 
ggccttcttt 
gcatgtgtgt 
atgaaactcc 
aagtgttttt 
tcagagttac 
taaagggatg 
ctgtggtctt 
tgagcttact 
attgtatacc 
tgactgcccg 
tcccctcata 
cccagggcaa 
aggacactta 



agggcttgca 
agtattcctt 
tccatcctcg 
gtgtactgtt 
tccaacctcc 
ttctaattgc 
gctctgtttt 
tcttaaccct 
ctaattgaat 
tttgacacac 
ttgaattcta 
atatttatta 
gagtaaagag 
gtaatctcac 
agggtgtata 
cagagcctgg 
taaaatcagg 
attgaaacgg 
ctgcaacctc 
gattacaggc 
ttgcaatgtt 
ctcccaaagt 
catttaaaaa 
ggtctgattc 
tcaggacagt 
atgcagacat 
atctgcatat 
tttaccataa 
cctgctttct 
attcctttta 
tgcgggggtt 
ttgccttttt 
ctcttctttt 
tccgattgat 
tctgtaaaag 
gaaactgatc 
tgtttgtggc 
caaggcgatt 
cccctgggtt 
ccccagggaa 
actctctctc 
aagcaaagaa 
tgagtgcttt 
ctgtggctga 
gatgtcagca 
gtgctccttg 
tgaactctga 
cattgcttct 
ataagcgtaa 
ctcagggttt 
aggagctggg 
ttctccacct 
ctgtgtgtgt 
ccttacttct 
tgccttcagt 
acacccaccc 
gggactctat 
ggttctactg 
gagtgccttc 
catttcacag 
actctaagtc 
gagactgggt 
ggtgtgagct 
gctatcctcc 



gatgggagtg 
tgattcctgc 
gctcaagatg 
aactttagtt 
acacctttgc 
atctttccat 
actatgcatg 
tctccttttg 
acaggatcta 
aagcaggcat 
accctgttgt 
agtacctact 
atgcagcctt 
aagaaaagta 
cagggcatcg 
gttgtctcac 
ttaaatgagg 
agtctcactc 
tacttcctgg 
agccacccct 
gaccaggctg 
gctgggatca 
tgttcagtaa 
agaaggaaag 
gactcctaga 
ctgaccttca 
ttttaaaaat 
ccactgaaga 
tccctctcct 
gcattccctc 
ttttttttcc 
tcacttcatg 
catagccata 
ggatatcaat 
ttttattact 
aaatgtgcat 
agttcaaaac 
ctgaagtgta 
cctgaagtgg 
gtgatgaaat 
cctgggggaa 
ggacctgggc 
ctctgctgtg 
cagtttctgg 

ggggcagcct 

gctggggaaa 
ggaggggaca 
cccgcctttt 
tagtcccttc 
gctgcgttct 
cttggcatga 
cactcccacg 
caatgacaaa 
ggctcttcta 
ctttctttct 
ccttctttga 
ggccaggagt 
tgacattccc 
tctcctttaa 
aaatacagct 
catgctctta 
tccaggttcc 
gtctgtggtt 
ctgccagggt 
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ctgagacttc 

caccagagca 

gctctggaag 

atatggtgga 

tccctgggtg 

cttgggtgag 

cccccagcat 

acctgagcct 

attttggtct 

ctcaagaccc 

ctgtagttta 

attaattatc 

ggcaggatca 

tcagccagca 

aaccagggat 

agagctaagg 

tatgtaactt 

gctgaggcag 

ccattgcact 

taaataaaat 

gagttggcaa 

tgcatagaat 

cccatcatgg 

atacggggcc 

tttctccagg 

tactgggctg 

ggtgaggtgg 

aggagatgct 

tctgactcca 

ggacaagagg 

ccactgcctc 

cagagctgcc 

ctccctgcaa 

cactctgctt 

catagaagtg 

atggagcctt 

tactgggcac 

cccagtttgc 

tggttggttt 

aatgaatgta 

gggcaccatt 

ctgccttccc 

aacacttgta 

gcttgccatg 

ttgggatata 

ttgagcaccc 

atcaataatt 

agacttgccc 

agcatgttac 

gggatgaagc 

caggacacca 

tctcaacccc 

atttttttct 

aggggtgaaa 

ctaaggtcac 

ttgttccaca 

agtgctgtga 

tctggtgtgg 

ttcccattta 

gtcttgttcc 

cagtccacag 

gggctcctag 

gaagtttccc 

tgaaaaattt 



cacctttgag 
ggagatggag 
tgaagatgcc 
caaaaacctt 
ccagacctgc 
tggcctatgg 
tgctatcact 
cagagaggtg 
ccaaccttca 
atgtgaattc 
cttgccggag 
gaatccagtg 
caggctcggt 
ctcagcttgc 
agatggacac 
atttagaatc 
aagttagctg 
gagagtcact 
ccagcctggg 
aaaaaaaaaa 
cccttgttct 
ctgatggtga 
tccagcaggg 
aggtctgctt 
gtcgctgcag 
ctcaccagcc 
ggctggggat 
acaaagagcc 
cagcagtgtg 
ctgaaagtcc 
agaagacttc 
ctctcctggg 
taaaatgggt 
agccctcatt 
tcagaagcta 
ttggagatgg 
ctgcatagtg 
atctctgtaa 
tatatgtaag 
taagtaccgc 
tcccagtggc 
ttcctcaact 
cgcaactccc 
cgtcagactc 
tggttatttg 
acaacatacc 
atgagcccta 
aagactgaac 
tttaaccctt 
acggggggag 
gcccagctgg 
ttctgtacta 
ccagcattcc 
caagaactct 
atagtaagtt 
agaatctgtt 
ggatgcagct 
aaaggcattg 
ttaagcattt 
tcataaatct 
agatcttctt 
tcccatcctg 
tgagcagtcc 
ttcttggccg 



acccctttgg 
gagacccctt 
tccaagaaag 
acgtgtcctg 
cgctacctcc 
ctgaggctga 
ggctcttttt 
ttggggtaag 
aatggttgtg 
atgtaggaag 
gtgctacagg 
ttctgtcagc 
aagagaagtg 
atcagatttg 
ccacagacaa 
aggaggtttg 
ggcatggtgg 
ggaacctggg 
caacaagagc 
ttaaaacaag 
accttttgtt 
taattgccgt 
aagggcccat 
ctctgccttc 
acgctttcag 
ctggtcccgc 
gaacgatgga 
tgaccctttg 
tgggtgggaa 
ctttctgfctc 
ctttcatctg 
cagctgcctc 
tttactgaaa 
gaaacttctt 
ctactccttg 
aggaatgggc 
ctgccaggac 
ccaaaggcct 
gaagttaaca 
aatccccggc 
ctgctgtggt 
gcattctctg 
tctctcagga 
tgaatgaggc 
taacataggc 
ccctgctgtg 
ttttacagaa 
ctctaaatgc 
gtgcttttac 
gaaagccaga 
aggtgaggga 
gggaggtcag 
tccaattcat 
atttttttgg 
gagaaagaga 
gatttcaaag 
gaggtggaac 
cttcccttat 
actaggtgtc 
ctgccgctgt 
tgggggagta 
tatctcgacc 
actctatagt 
ggcacggtgg 



gtgctaagac 
gcagggagct 
atggggctgt 
aggaagagga 
tggtgagaag 
ggtgggagca 
cccattgagg 

aggggagggc 

gcaggggagg 
gatgctccat 
ggcaacctgg 
gcgctcaacc 
tgaacactaa 
tctgtttttc 
cactgagggg 
ggtccaagtt 
tgcatgtctg 
agacagaggt 
gaaactccgc 
accatgagtt 
agggctggaa 
ggatcagcat 
tgcccggtgg 
cctctgcccc 
tgggttgacg 
ggtggtcact 
aaggtctggg 
tgggagaggc 
aagaggccct 
tgccttcaca 
ttcctactga 
ccctcctctg 
tggatttatt 
ccttatcatc 
aggaggagga 
tcagctagtt 
ctttcaaggt 
tttctctctc 
tgtccctggg 
ccctcaagtg 
agttggcctt 
ctccttccta 
tctccttctg 
ctgggaatac 
acaaaaacat 
gcaggcacct 
ccaggctcag 
aactcatatt 
agctggctac 
ctgtctcact 
gctttaatca 
cagaagaaaa 
agatcttcat 
agtgtggtat 
gagaaatcaa 
atggtggact 
tttcactcct 
ttcaatatta 
aggtactgtg 
gggtgtgtac 
atatcaagaa 
taggcatgtt 
gtgctttata 
ctcatgtctg 



gctgcctgag 

ggaggaagag 

tgagtctatc 

cacagtaaaa 

tcttcagacg 

tggaacgggt 

gccctggggg 

cacctacaaa 

gagggaatga 

tctttgtctt 

tttccatcca 

agggtcaagt 

atggggtgca 

tcctgtataa 

gctgcctggg 

cctttccatc 

taatcctagc 

tgcggtgagc 

ctcaaaaata 

tgtttcctca 

ggacaagcct 

ttcagatgac 

gcagcagaaa 

atcccttctt 

gcagccgctg 

gcgtggccct 

agatgggaag 

ttcctgggtc 

cctgtgggtt 

ggaggccact 

gctggtccca 

cttgccatcc 

ttctcctctg 

tctccccaca 

tggagggtgg 

ctcttcatag 

tgtaggtaga 

tctctccaac 

aacagtccac 

gaataaatct 

attccatgca 

ctctctgcaa 

gggaaacctg 

aagacatagt 

ctactagttg 

tgcctagatg 

agaagttagg 

gaaattcaac 

tctcccctta 

cttgggttca 

gaggggaggg 

taattcaatg 

atgggattag 

agagaaggga 

actcaggttc 

atgggttcat 

tgccctcttg 

acaacaaagg 

ctaaatgtta 

tttgacagaa 

ggggcacgaa 

tacattggtg 

ggagcacatt 

taatcccagc 



gatgaggaga 

gaggagtggg 

tcagtgccag 

gtggtgggca 

tttagtcaag 

gtgggatatg 

tgtcagtaga 

cagaagi:tgc 

attgtgggga 

ttatcctgcc 

caacttcaat 

ctggattgga 

cctgctgatc 

tctccagaag 

cattcaggga 

tctcactatc 

tacttgggag 

cgagatggag 

aataaataaa 

tctctaggat 

gtcactggga 

taggacagtt 

gagctggcag 

cccctcttgc 

gaactttgca 

gtgtacccga 

tgccccaagg 

ttttatatac 

gagttgggat 

ggcgtcgagc 

gccagcagtt 

ctccctccac 

atcgcggatc 
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agttgggtct 

aacacctgat 

ctcccaatgg 

cccagaactg 

aacattcagg 

aacatgtatt 

tttttatggg 

ctcccaaata 

atataagaca 

cctctggcac 

fctatcgctta 

acctcatgtg 

atctgtcaaa 

tctgctccaa 

fcggtcacacg 

tcttgggaca 

aggaaggcat 

ttctaaagcc 

gggctcagag 

tgctacttct 

atttcaacta 

ccctgtggtg 

gactttatat 

gtaataatat 

ggtgaacttt 

gtttgacttc 

ggaagctgca 

cattcactgt 
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actttgggag 
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gccgagacag 
aaccccgtct 
tcccagctac 
agtgagccga 
aaaagaaaga 
tgggctaata 
ctaagctctg 
cactgtgcag 
caagggccac 
gatnnnnnnn 
nnnnnnnnnn 
agagttatgt 
gttttcttcc 
ttttcccaaa 
gtttcttgta 
attgtgtagg 
gagactggga 
cagaatcatg 
gaagaatcaa 
tcacaagaat 
cccacccaca 
cacagccaaa 
acactattct 
gcagggacct 
gttgccacag 
cagtagtgtt 
cctgtcagga 
gagtcagtgg 
agcattattg 
gtttctgttg 
cttcgctgac 
ggttcctcca 
ctgctgtgcc 
accttcatcc 
acccttgcgt 
aagctggagg 
actgggtttg 
ttcatgcact 
ctcgtattca 
tgtcttccct 
tgtctggcat 
ggtcctgtct 
gtctgagcag 
caggggagct 
gggtccctga 
tttggcaagc 
cccatcttca 
aggatgggga 
gtgatggcct 
ttggggcaaa 
ctcacgaaac 
caggcccacc 
caggacctct 
ctctaggagg 
ttacgtgttc 
tactgcacag 
gtttaaatcc 
cacgatcttg 
ctcgatctcc 
ggcatgtgcc 
gctatgttgc 
tccaaaatgc 
atctgaggcg 
gctggtgcct 



gcggatcacc 
ctactaaaaa 
tcaggaggtt 
gatcgtgcca 
aagagatttt 
ttccaaaata 
tgaaactgtt 
agcacagagt 
cctcactcaa 
nnnnnnnnnn 
nnnnnnnnnn 
tcattgattt 
tattattatt 
ggtaaaactg 
ccccttttcc 
tgtatataca 
agaaaaagag 
gcaggaggtg 
aagtggaaac 
agcgtgggaa 
atacgtggga 
ccatatcaat 
gcagagtttt 
ctcatgcctt 
cattcctgta 
acaaagagga 
gggtattgca 
ggttggagag 
atgagtttga 
ggacagtgga 
ttctgttggg 
ctttggcggt 
tgggcttcgc 
tgcaagtgat 
aagtggcctt 
taaagaaatt 
ggggcaggtg 
tgagccttta 
tcctctgagg 
tctttgtgcc 
gccatccatt 
gtttttccag 
agctgaagag 
ggggacaagc 
agctcactgg 
tctttgggct 
ccctcatcaa 
tccctggcag 
gggtatgtag 
agtatttgtg 
ttcaagcaaa 
ctggcctctc 
gtgctttcac 
tggtcgtgag 
agaggcacca 
ctctatttgc 
atcacttgga 
ctttgtcacc 
tggactcaag 
accatgccaa 
ctgggctggt 
tgtgattaca 
aatgacttga 
ctaccagctt 



tgaggtcggg 
tacaaaaaaa 
gaggctggag 
ttgcactcca 
ttctttttct 
tgggtttgtg 
tatctatggt 
aggggctcaa 
ctcatgtgcc 
nnnnnnnnnn 
nnnnnnnnnn 
cattttattt 
ttattttatt 
tgaaacaaga 
tcttctttag 
tgtattagtc 
gtttaattgg 
aaaggcactt 
ccctgataaa 
agactggccc 
attctgggag 
acatttccct 
tttttttttt 
ttaaccactg 
ctgatgacta 
tctggctaca 
gtaatccatg 
aagtgggcac 
ggtggggcat 
tggatagtgg 
ttggcctaca 
ggccctctgc 
cctctgtgcc 
cagccgctcc 
ggggcgggct 
gggaggcaag 
tggagggggt 
catctgcggt 
cccacttctg 
cttagtactc 
tacatgcctg 
ggccccgcat 
ggaaaggcca 
ggccgacagc 
aacccctctg 
ggtgatggcc 
aggctccctt 
gaggcctggg 
catggtgcag 
tgtgtgggga 
atcctgagac 
ccctaggccc 
ctctgtgtcc 
tctcctgctc 
gctaaggagc 
ctcatttttt 
gaaaagaaag 
caggctggag 
cgatcctctc 
gcttatttta 
tttgaattcc 
ggcgtgaacc 
ttgfcgacatc 
cccatgtgac 



agtttgagac 
ttagccgggt 
aatcgcttga 
gcctgggcaa 
taaaaagtaa 
tgtgaatttt 
tctttatcat 
tcaacagtgc 
aggcatagtt 
nnnnnnnnnn 
nnnnnnnnnn 
tgtttctgat 
tgtaaaacat 
tatattcaaa 
gtaaccattt 
tgttttcatg 
acttacagtt 
cttacacggt 
cccatcagat 
ccatgattca 
atataattca 
ctcttttaga 
gatgtaactc 
cctgggtctc 
tttggattgt 
tgttcagggt 
actgagttaa- 
atttgaatga 
gtggggaaag 
ctcctcccct 
gagagcttct 
tcgacggtgc 
tcagtcccca 
ttcctctatg 
ctgtggagac 
gcggggcctg 
gagaccagag 
gccctctccc 
tttcagttct 
tagtcttact 
tcttttcttt 
gtgccttctt 
gacaggtgtg 
ccccagaggc 
gtttctctcc 
ttgtcggctg 
cagaatgacc 
ccttaggcct 
cttcccaaag 
aacagacatt 
ctcaaaggtg 
acacagggtt 
ttacacctgg 
tttgcagaaa 
aaaagtcaac 
atttttaaag 
actaaatgtt 
tgcagtggca 
atctcagcct 
ttttattttt 
tggcctcaag 
actgtgcatg 
aggtgaccta 
cttgaacatg 



ctgcctgacc 
gtggtggcac 
acctgggagg 
caagagcgaa 
aaatcatgaa 
cctctccagt 
tgaatccttg 
actcattgct 
ctgagagctt 
nnnnnnnnnn 
nnnggtcact 
ttttaaagat 
ttacatatca 
gaagtttact 
ttattttttt 
ctgctgataa 
ccacatggct 
ggtggcaaga 
ctcgtgagat 
gttaccctcc 
acgtgagatt 
taaaaggtag 
tatcctgagg 
cattacatgg 
ttccagtctt 
ggggaggggc 
tggtagttta 
tatgtaggag 
gattcgagga 
ttttccaatc 
ttttcctctc 
cttcgctggc 
tcctccctct 
ggagcaacgc 
ggacacactg 
gaggcagtca 
gtggtgggaa 
tctgttttct 
ccagggaaga 
tcctttgcta 
cctggtgcag 
ctgggttctg 
gttggagggc 
caggcttctg 
ccagtttccc 
tggtgtctct 
cattttacgt 
tggctgcccc 
ggtctgtgtt 
ctggagtagg 
tttcctgctt 
tccacagttg 
agggatgctc 
ctgaggctca 
tttgaattct 
cagcaaatct 
ttttgttttt 
caatctcggc 
cctgagtagc 
ttgatagaca 
cgatccaccc 
gccaaaagag 
gtaatcagct 
tcattgaatg 



aacatggaga 
atgcctgtaa 
cgaaggttgc 
actccgtctc 
ataaggggac 
aagatactaa 
gagttcctta 
ttttcataga 
tgcttaagct 
nnnnnnnnnn 
aatttggtat 
tgttttactt 
gacatttaca 
ttccctctct 
aaatataaac 
agacctatct 
ggcaaggcct 
gaaaaatgag 
ttattcacta 
cccactgggt 
tgggtgggga 
tatactgtat 
gtgctctgta 
ctgcagcata 
ttgctattac 
agatgtgtag 
aagctaggat 
gtgaatgatc 
tgactcccag 
ttccttggcc 
tgttcgccca 
cctgacatcc 
ccagtacctc 
ggccttcctc 
gggcaaagag 
ggtgcgggag 
ggatagaaca 
acctggtgaa 
aatggaaaag 
gtgcgtgcat 
cctgcatgag 
tgggtcaaat 
aggcctagga 
cttggaggga 
ttcagagcac 
gctccagttc 
gagtactggg 
aaatctggct 
attcaagtat 
gtggggaatt 
gtggtgagtg 
gccccaggga 
tgaggtcctg 
aagaggttac 
gtgttttgac 
tagaatagga 
gttttggaga 
tcactgcagc 
tgacactaca 
ctggggtttc 
gtctctgcct 
taaacttgaa 
gtgtattcta 
ctcgctaggc 
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ctctgtttct 
agaatgaaaa 
gaatataaac 
ttcattgagc 
ctgcaggtga 
gtatatcggg 
cctcactttt 
gtcccaaaag 
taaagacaca 
tatcacacag 
ccttgcactt 
tgtcaactat 
cacgggttaa 
tctccctcaa 
acagggttcc 
cctggtctgg 
aaagttcttc 
ccccaagggc 
ttggggatgt 
ctctcgggaa 
atctctctag 
gtatgtttct 
agtgtggcag 
cctgggatgt 
cacatcctta 
gctggatgga 
agccccttaa 
atctctgatg 
cacatgaggg 
gagaagttga 
cacctgctgt 
gggggatggg 
aatgtttatt 
tgcagaaggg 
aggggccagg 
cgtttaaatc 
cccactgtcc 
tgatctcaac 
ctgtggacca 
gctcaccctg 
atcttctcca 
acacccttcc 
gcagcatact 
atccacatgg 
ttttggcctc 
gtgtggtgca 
ctctgtgact 
tctgggctgt 
gatcacagct 
agttggtgaa 
agtactgaca 
ccagacacaa 
tggctcatgc 
aggagttcgt 
agtgtagtgg 
tgagcctggc 
tgacacagtg 
ttccagaata 
gtggaggcca 
gggtggactt 
atgcctggat 
gggaagatgg 
tctgccctga 
cactctgtta 



ttatctgtga 
gtaactgaaa 
tcggtagact 
aacagttctg 
atgtgatgtt 
aatgccgtac 
cagccccgag 
actttgccta 
caaggacgtc 
actcaaagca 
agccttggca 
ttctttaggc 
attttcaggt 
gagtgcagcc 
ctgggcccag 
tctcagaatc 
ctggagcaga 
catgggagcc 
tttcaagaag 
taattttata 
gctttgggcc 
gaggaggccc 
ctggacccag 
gggcccagag 
actaacttta 
atcagaaatt 
ccactctgag 
gcgattgtga 
tttgctcctg 
gggtgcagga 
ctcattccca 
ggtggaaggc 
gagcacctgc 
tctgaaaccc 
agcctgggtc 
taatggatgg 
acaggacaaa 
ctctcccctt 
agattcccag 
cctcagcttg 
aaaatcatta 
tggtgactgg 
ccaggaggca 
taccagctaa 
agaaccaatt 
gtggaaaaaa 
ttggatggac 
tgtaaggatt 
agcattacag 
cggacaaagg 
cctgctacaa 
gaaactgcta 
ctgtaatccc 
gaccagcctg 
catgcacctg 
aggtggaggt 
aggttccgtc 
ggccaatctg 
tgggtagtga 
atggtatgtt 
tccccaggaa 
gcgagaggag 
gagccttcct 
ccctggctgg 



aatgggcttg 
gtccttcctc 
tctggtccct 
ttgttcagaa 
catgcttgcc 
ttggaaagaa 
gatggttttg 
tcccagcaaa 
tgcgcagcaa 
aaggcatgtg 
gacccttgac 
ctgtttggct 
gaagagtgag 
ttggctagag 
tctcagccca 
acttttccca 
tgtcctagaa 
agctgacctg 
ggatataagt 
gaaaaataaa 
ggggctgtgt 
taccctggca 
aggtggggca 
acttcctccc 
gcagaaccct 
cctagtttga 
tcactagttc 
gaatgtaaag 
tgtttactca 
ggagagatgg 
ccttgctagg 
agggggtggc 
cacgtgtcag 
agctcttcct 
tgagggcgga 
atttggtctt 
aatccagatg 
ccctctttac 
tctctgggtc 
tgaaatccat 
ctcaggcttt 
catttgcctc 
aggtctgttc 
ttcttgttga 
agaaggagcc 
acccttctgg 
cacttctttg 
acaaattagg 
atgatatatt 
aaatgtacca 
cacggatgaa 
atgattccat 
agcactttgg 
gccaacatgg 
taatcccagc 
tgcagtgagc 
tcaaaaaaaa 
tagaggcaga 
tggctaatgg 
aattatacct 
gtgtacagta 
agtaagtggg 
cctgcaactt 
agtgcagtgg 



atattcctcc 
cagggcacca 
tcattggtca 
ttcctggatt 
attcttctga 
agtccctctg 
ttcatcttcc 
acacacacac 
gaaaagaatc 
gaacttcttt 
tccaggggag 
ccgaagccta 
gttgtcatgg 
aactcacagc 
gccactgatg 
tctgtaaaat 
ggttttagga 
cttgaccgaa 
tatttacttt 
gcttctgtgt 
gtgtgtgtgt 
tgagagggta 
ggaaccctga 
ccaggaaccc 
actcctcact 
gtttcaattc 
cccacctgca- 
tcattgcaac 
tgacccttgg 
caggtgggtg 
agagggacta 
aggcaggttc 
gccctgtcct 
tcttcctgtg 
caccaaagtt 
gttaccctgc 
ctagcctggc 
ctcctaccaa 
tgtgtgtgca 
ctggtctgct 
ccagcatgtc 
cacatcatga 
tcgcctggct 
atgaatgatc 
agaaaaacac 
aaatctcagc 
tcagtatggt 
tcgagtgctt 
aaagccaaaa 
tatttgggat 
tcttgaaaac 
ttaaatatga 
gatgccaagg 
cgaaaccccg 
tacttggttg 
caagatcgtg 
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aagtagattc 
ggaaggggtt 
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gacttctgtg 
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tatttattta 
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gggaatctgg 
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ctctgcttcc 
acacaccccc 
tgcccctcag 
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tgcctagcac 
ggaggacggg 
ggatgggaga 
ggaaagaaca 
atccatccat 
gggtgctggg 
gatgtcgggg 
ctagtggtgt 
tcaaaagctt 
attcaaggct 
cageggggea 
ccagttcctc 
gggatcctgc 
tgagtccctg 
ccctcccacc 
ctaattaatc 
gttgaatgag 
atgggggtgg 
tctgtcactt 
gggagaaata 
ggcatgtggt 
aaagatgect 
attatttggc 
attagactaa 
aatateggge 
tgggcagatc 
tctctactaa 
gctgaggcac 
ccactgcact 
ggaaaaagaa 
atgattgggt 
tcttttgggg 
gttatttaaa 
agaatcagaa 
acgtgcgcaa 
tttattttga 
gctcactgca 
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tctgctctcc 
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aggacctcgt 
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ggaggaagtg 
gatcctctgc 
gcttcctggc 
agaggagcag 
ttggcctgac 
tttggtgttg 
agtcaggeca 
cagactatct 
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tactgtttcc 
gtgtgtatgt 
ctacacatct 
ccccgctggt 
tcttcctctc 
agctagaagc 
cagctgggca 
aatcatttct 
atggtaggag 
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atctggggca 
gtgggaggca 
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gctataaaga 
tgtaatttcc 
ctattagcag 
tcagcagctc 
gtcactagtg 
gagcccaccc 
tgcgtgggtg 
tcaaaatgtc 
gcacttggtc 
ccttgcctgg 
tgtgcttacc 
tggattcttg 
gggaggtgca 
actttgtcag 
gacatgcctc 
gggttgaaca 
aatgtccacc 
aatcaaaaaa 
gtgaaagaag 
cagggtgcag 
acttgaggee 
aaattagccg 
aagaattggt 
ccagcctgga 
aaaaagaaat 
aggcctgggt 
tgatgaaaat 
ggaagaaaag 
atgatttctg 
ctctcatcgt 
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gccaggctca 
caccacatct 
cactctgttg 
tcccaaagtg 
cctcaagtgg 
tcagcaacgg 
tggaggtgaa 
ggagaacaca 
ccttttaagc 
aaagtcaggt 
tcccaaggtc 
tcttcttctt 
tcttcttctt 
tcttcttcct 
cttcttcttc 
tcttgctcta 
ctggccttaa 
caccatgccc 
aaataaagag 
tggttaacat 
tgatctcata 
ctgctcacaa 
tccctttccc 
ttggtcagca 
gtgggcttat 
tcactatcca 
aggtgctgct 
gcgagaggga 
gagccttcct 
ctcactctgt 
ctcccaggct 
tgtctaggag 
ggtgtgacat 
catccaggct 
aatgtctgag 
gattttgagg 
tctctgtcgc 
cggttcaagc 
ccacacccgg 
tggtcttgat 
caggcatgag 
tgctcacagg 
gctggctaga 
tgtgacctta 
cagtttgttg 
ctggagatgg 
gattcctggt 
gagttgtaag 
agtgagtgat 
gttattagta 
ctggaatgcc 
tgtctccacc 
gcaaaaaaaa 
atcaatcttc 
tccaagcaca 
gggctgggga 
tgtccttgta 
tcagagagta 
taggtccttc 
tcaagtacgc 
cgctctgtca 
tcaagcaatt 
gccagctaat 
ggtctcaaac 



agcaatcctc 
ggctaatctt 
ctcaggctgg 
ttgggattat 
aaccaatgag 
gcttccaatg 
ttggccccca 
tgccgtgtaa 
ctattcagcc 
tcacccatta 
acattcattt 



cttcttcttc 
cttcttcttc 
cttcttttct 
ttcttcctct 
ttgcccaggt 
gcaatccccc 
agcctgcatt 
ggacatgaaa 
gttggtatgt 
agcttttcct 
ccccatctcg 
cttccctgcc 
gattggagag 
cacaaaagag 
taatctggtg 
aagaacactt 
gtaagcagga 
cctgccacgt 
cacccaggct 
caagcagtcc 
ccctttttcc 
gggcacagca 
tccaggcata 
tgtatcttaa 
tgttttgttt 
caggctggag 
gattctcctg 
ctaatttttg 
ctcttgacct 
ccaccgtggc 
cccctcagca 
gcagtagggt 
accaccctga 
caaggactag 
actgtccaag 
acctccgctg 
gattgagtgg 
caatataaac 
acatctgtgt 
actattatag 
gaacaaaaca 
aaaaagactt 
atattcaatt 
cagaaatgtc 
agagtgcagg 
ggagtcatga 
ataaggataa 
attaatggca 
tacatgacag 
cccaggctgg 
ctcctgtctc 
tttttttggt 
tcctgacctt 



ctgtttgagc 
ttatttattt 
tgtcaaacgc 
gggtgtgagc 
gaagcaagca 
caacccttct 
tcacaccccc 
gggacagcca 
cagggaagtg 
gcacattgtg 
agacattacc 
ttcttcttct 
ttcttcttct 
tcttcctctt 
tcttctttct 
tgaatgcagc 
tgcctcggcc 
tattctcttg 
aagacgcatc 
tccctcccgt 
tatgtaatca 
tggttctgcc 
tgtgtggggg 
tgcaaataat 
caccttggga 
ctaactgtac 
ctattctgtg 
gaaacaaccc 
ggttttgttt 
ggagtgtagt 
tccccaaatt 
tgccagcccc 
gtgagtcatt 
agatccagtc 
acccaggacc 
gtttgtttgt 
tgcagtggca 
tctcagcctc 
tatttttaat 
cgtgatcctc 
ccgcccaatt 
tattctttcc 
gtgtgggaaa 
acctcagttt 
agagtaacct 
accaacttcc 
cttcatctgt 
ctggatgaac 
cacctggctt 
gcccttcaga 
ttcaggatag 
acacaagttc 
tcgagttacc 
tttgcgatac 
aattaccagg 
cagacagatc 
agcatcttag 
gtaaggctct 
gctactaatt 
tgaaacagtt 
agtgcagtgg 
agcctcccga 
atttttagta 
gtgatctgcc 



tcctgagtag 
atttatttat 
ctggactcaa 
cactgtacct 
actcagagct 
ctcctgtaac 
acagtgccaa 
acagcataaa 
catatgatca 
gggctggtat 
tttactttac 
tcttcttctt 
tcctcttctt 
cttcctcttc 
tcttcttctt 
atcatcatac 
tccaaaagtg 
taagaaagat 
accattaatt 
catttgactg 
aatagtagcc 
acgccttgtt 
tcctagatga 
aaaaacactc 
tattccagag 
tttagctgaa 
agaatcagag 
acaggcacag 
gtttgtttgt 
ggcaagatca 
caaagcttgg 
tcaggggatt 
cctctgacat 
ctctggtgat 
atggcttgtg 
ttgtttgttt 
cgatctcagc 
ctgagtagct 
agagactgtg 
ccgcctcggc 
ttgagttttt 
tgggtccagt 
gctctgggct 
cctcacctgt 
tgggaataaa 
agttcctccc 
aaaatggagt 
gtcaagcact 
gtagcaggtg 
gcgtgcacca 
aaaacctccc 
tttattactt 
ccaaatctta 
ttttgtctcc 
ggcagggcta 
tgggtcctgt 
tgcctttgtt 
acgtgaaaag 
tttattacat 
tttttgtttg 
cacgatcttg 
gtagctggga 
gagacggggt 
caactcagac 



ctgggactac 
ttatagagat 
gtgatcctcc 
ggcacctcct 
ttcacaagtt 
cagcctcagt 
gctgggccct 
ataggaattg 
gccccatttc 
ttaaaccagg 
atttcttctt 
cttcttcttc 
cctcttcttc 
ttcctcttct 
cttttttttt 
ctaaatgcag 
ccaagatttc 
atcatttaaa 
ggaccactca 
gatgtatgtg 
aaaaacatga 
tatccccatc 
cggtgagcca 
agaaggcgag 
aatgacctca 
ggtgctggca 
atgatttcta 
ctctcatctt 
ttgtttgttt 
tggctcactg 
agtgatggtc 
gatgactctc 
tctttgggaa 
aaggagttca 
ttcacaccag 
gttttttgag 
tcactgcaac 
gggactacag 
tttcaccatg 
ctcccaaaat 
atgttctaat 
gtcacctccc 
ttgcaggcac 
aatggaaata 
aggtagcagc 
cacacaagct 
aacaatagga 
tcaaagggga 
tgctgtgtgt 
cacttcacac 
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ggaatgggaa 
agccaaagtc 
ccagcagtca 
tgaattcctt 
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tgctacctat 
tgctcggccc 
acgcaaaatc 
tttgttttga 
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cctggcatgt 
ggctgcaggt 
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cgctttagct 
aacttcaaag 
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atggagagaa 
tcagagccct 
ttagattggg 
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gacagagtct 
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tggcatgagc 
ctacccaacc 
tttctgtact 
ttgtcacagt 
ccattaggct 
ctgtatcctg 
catgagagag 
aatcccagct 
aatgagcact 
gagttctttt 
ttccaagcct 
gttgggggca 
cctcctcaag 
caggagactg 
agaatggaac 
ttttaacatc 
ttacagggta 
cacaggggaa 
acttacctgg 
gtttaatggc 
atacttagca 
aatttctgcc 
aaattgcttt 
ttccagaact 
ggggaaacag 
cctgattagt 
tgatccttgg 
ggcattctgg 
nnnnnnnnnn 
nnnnnnnnnn 
tgtctcaaaa 
caggctggag 
tgatcctcct 
cagaccatta 
atggataaat 
agtcctgata 
atcacaaaag 
ctaactttta 
ggggcagagg 
ataaagaaac 
actgttaaat 
aagctgtttt 
agaagcagga 
tgtcagacct 
caccagcacc 
gctcatgtct 
gfcgttcgaga 
tagccagacg 
aatgcactcc 
cataggggac 
tcatagggga 
actcttatgt 
cctggctgcc 
tgggagagtc 
tgcggcagat 
tccacccctc 
ttgaaacgag 
ctcacagcac 
gctcaatgtt 
tcaaatccaa 
gataaaaaga 
gcacagctct 
gcagtggtaa 
ccaaagagtg 



caccgcacct 
tatagctaac 
gttacctagt 
tgcagtttaa 
ttcatttggg 
agaactttgt 
agagagagag 
ctgagcttct 
tatgcagaag 
ccaagctcca 
tgctttttct 
ccatgtttca 
ccctgcaaca 
gggggatgag 
cagggtcttt 
cgaggaccac 
agttacatta 
gtgagtcacg 
aaaatgactt 
acttgtgtcc 
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tcttcattta 
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tgttctgaaa 
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aggtggagaa 
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ccagcctggc 
tgctcgcgtg 
agcctgggtg 
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gtatcatttc 
aaccccacct 
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cctggggtgg 
aaggaatcgg 
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gctaaacttc 
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ctccttccca 
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ttagaattcc 
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caaggcagct 
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cacacccagc 
caactgacaa 
aataaaaaga 
aaagcagcca 
ataggtaaat 
ctagtgcttg 
acttttgggg 
acatactaaa 
tatcttaata 
ccctctaaag 
catataaatt 
aaagaaacac 
aagagtggtg 
ttgaggtcag 
atacaaaaat 
agatcatgcc 
aaaaaagtta 
caggataaac 
tcttgcagcc 
gccatcccag 
ccaggaggca 
catggctgga 
cctccccatc 
caggaaggag 
actgcagact 
ggcaactgaa 
aagctgggat 
ggggaaaggg 
gagcttgtca 
ctagacacca 
gttactgctc 
gtgattgaag 
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tggaaaccag 
caaatgagta 
cctttatatt 
aaattcctct 
aggactctgc 
ccactggatt 
gaagaggtcg 
ctaacctttt 
attttcaggc 
cagggagccc 
ggccatccac 
cctgcacaga 
catcatggtg 
catagagccc 
ccagaggtgc 
gtctgcgcgc 
agggccgggc 
gcagagcgcg 
ctagaagctg 
gcccctccaa 
gtatcaggtg 
ggagacccag 
gcagggctcg 
gagggggctc 
gcggatgggc 
cgccgcggcc 
agcgacaact 
agaggggagg 
ggggctttgg 
aagcagagga 
gaaaagaaaa 
gcaagtaacg 
tctcagccga 
ctccggtcac 
tcgcggcgcg 
tctgcgtgca 
cgctgtctcc 
agctgggctt 
ctgcgccccg 
acgcgcacgt 
tacatttcca 
gccctctgta 
cagggccgcc 
agagtggcac 
tgactaccct 
aattcggtta 
aaatgctaag 
tttcagaaga 
ctctcagcct 
taatgagaag 
atgctgagct 
tttacctacg 
aaaatttttt 
cccagaactt 
ctggacaaca 
ggtgtgcacc 
gagatggagg 
aagactctgt 
taaaagggaa 
agacagggca 
atttaaacct 
tagaggtagg 
ccagaaggag 
tggccaaagc 



agactctcca 
atttgaaaac 
catactccac 
cccagcacgt 
ctattagtcc 
agataaatga 
agaagtgagg 
ctggcttcca 
atacctgggc 
cagcccagga 
cagcgcctcc 
aacagagcgc 
gcacccgaga 
tgggcttccc 
acgcggcacg 
tgcagcttcc 
tgctggggag 
ctctgaagca 
gcaggaagaa 
gacctctcct 
acccacgacc 
gacgggctct 
gggcaccagg 
ggacttcggc 
tggcgggcgg 
gctgcagcct 
gggatgagac 
aagtaccagt 
gttcaagcgc 
agtgcagcga 
gggaaggacg 
gtctttcctt 
cgcatagccg 
cggcaaatga 
tccggagtcc 
ggacccggcc 
tcggaagaag 
ggatgaagtg 
aggtgggggc 
gactcgcccg 
ccctggaccc 
tccttgttca 
ttgggtgggg 
gggtttcttt 
gggfctaccta 
aaatggaacc 
catagcattt 
cctggtgatc 
ccctgagcct 
taatctcatt 
taatacctag 
taacaaacct 
taaaaaaaga 
tgggaggcca 
tggtgaaacc 
tgcagtctca 
ctgcagtgag 
ctcaaaaaca 
cccctgaaca 
ctttccaaaa 
catagctttt 
atgcagagtc 
aatggcaaga 
acatgtcagg 



caatgccctc 
ctgggggctc 
tttgtgcagc 
ctccttcaaa 
cagggtctgg 
aacagtgggc 
ctgtccaact 
cctgcccctt 
acgtgctgga 
gtacagcaga 
ggtacgcctg 
tgggtgaagg 
cctctcgggc 
agccggtgcc 
gggctcccgc 
tagcaggctg 
aaagtccgca 
ccgcgggccc 
gggcaagttc 
tccctctggg 
ctacagtctc 
cctcggfctcc 
ctttgcacct 
aggaagtctg 
gtgtgtttac 
cggcgggagg 
cgaggaaagc 
cacttcttcc 
tccaggaggt 
gcggggaccc 
cgcttggcga 
gggaagcctg 
gagaccctac 
gcagccagca 
cgtgggccca 
gccacggagc 
cgggggaact 
gatctgcgga 
gggggctccc 
ctggtcccgg 
ccatcttctc 
aaacaaagtg 
ttttgaactg 
ggagttagtc 
gggcaagtta 
tacctaactg 
ggcatagtaa 
ggatgcttcc 
cagtatcctc 
acatgtgagt 
gtgatgggtt 
gcacattctg 
gtgatactgg 
aggcaggagg 
ccgtctctac 
gctacccagc 
ccaagatcat 
aaacaagaat 
gatgagccac 
taaaaatact 
cacagaggga 
aggtcggcct 
gcaaaggctt 
ctctgatggt 



ctagagtaaa 
ccagctcctg 
tctcccttgt 
gagctctaga 
atgaccagca 
agagtgccac 
gctgaccctg 
tgccatccct 
atagaagccc 
gaagaagagg 
ttgcagcgtg 
gccccccagt 
cagcccgcga 
aaggagctgg 
tgagccacta 
ccgggttctc 
tctgcccagg 
atgtccggac 
aaggctaccc 
gccgggcgac 
tcgggccaag 
ctcctccccc 
cggaacccgc 
gcggctgctg 
caaagggagg 
agagggaacg 
ggagaggaga 
agggggactc 
ccgctggaac 
gggaggaaga 
gatgggacac 
gcagtgtcgg 
gcgcgccccc 
gctgcggacg 
gccctgagcc 
ttcagcctga 
gggaacccgc 
gttgatagtt 
ttcttttccc 
acactctccc 
ccccaagccc 
cgattcggct 
tgcagctgga 
agatcgaggt 
cctctctgag 
cccaaaggaa 
gcataatgtt 
agatcaacaa 
atctgtaaaa 
ttaattgtgt 
gataggtgca 
cacatgtacc 
tggccaggtg 
atcgcttgag 
aaaaaagaaa 
aggctgaagt 
gccactacac 
gactacagaa 
caagccaaga 
gttattaatt 
gaaactgcag 
ggcatttaag 
agctgtggga 
ttaacttctt 



tgaggctatg 
aaaagggaag 
cttatgatag 
cctgaggctg 
ggacacctgg 
ccaatctccc 
ctttctgtcc 
ccccccagcc 
tcgttcttca 
ttctccagca 
ggggccatgc 
ggccccaggg 
ggagcccctc 
ctccgcgcgc 
tcggaaacaa 
tcacccaggc 
tccccagagg 
tctcgcgcca 
tacgacccca 
agcaagccct 
ccaacagctg 
gcgcgcccct 
ttgcccccct 
actttataag 
gaaagagccc 
cgggcagcgc 
agggcaagaa 
ggtattctca 
tctggcaaac 
gaagagtcgg 
tgtgccgcgg 
cgggagccgg 
tccccgccca 
cctccgggag 
gcgccggcgc 
cagcccggtg 
cgggcgccag 
gtatttacac 
ctccccttag 
tctggcacag 
ccagactaac 
gaagccgact 
agcagtgttt 
ctgagtcttg 
cctcagcttc 
tcgcgattgt 
aattgttgct 
ttgattgact 
tagactacta 
gttaagagtg 
ataaaccacc 
ccagaactta 
tggtggttca 
ctcaggagtt 
aaaaaatagc 
gggaggatca 
tccagcctgg 
agctccaaga 
gaggaactaa 
cctcaagaca 
gcttgaagct 
tacggctcct 
atggcacaag 
aaaatgcaat 



taactttgtc 
gatgtggggc 
ccctattaag 
tcagaggctt 
cattcagtga 
cctgaagttt 
acctggccac 
cacccagccc 
gaatgatcaa 
cagccgtgca 
tggccccgag 
aagggtcctg 
atggaggccc 
actagcagtg 
ggaaggtcct 
cagggcgctc 
acagcaaggg 
ggaaagaccc 
tcttccagtt 
ccccctttcc 
ccacgtggag 
cactcactcc 
ccagccccgg 
ggcagcggtg 
cagctccccc 
gggggcgggg 
agacccagag 
tctgtgaaac 
gcgcagctct 
aggggtcaga 
gaccgcgggc 
cctcggtgtc 
cgctgctcac 
cgcaacgctt 
tggggtcttc 
gcctcgcctc 
aggtctgcga 
gcgtccggag 
gtcgagtttc 
ccccagcacc 
atcaggcagc 
gaccgcgatt 
tccgagaggc 
actttttaac 
ctcctcttta 
gatgcaggta 
gctgtcatta 
ccaggtaaat 
tggtgtggag 
ctgctaatgc 
atggcataca 
aaataaaaat 
tgcctgtaat 
cgagaccaac 
caggcatggt 
ctgagctgga 
gtgacagagt 
aggcctcaga 
tggctaccat 
tcatggtccc 
ggagcaaggt 
tccattcctc 
gagttctcgg 
actgcctccc 
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agaacttcca 
tctaagctag 
gtggctcatg 
caggagttca 
ctgagcatgg 
cctgagcctg 
ctgggtgaca 
tactcttgac 
caaggttttc 
acacttgctc 
ggacacaggc 
acctcaccta 
ctcccagtct 
tttttttttt 
ctcagctcac 
agtagctggg 
tcttttttcc 
tcggctcact 
gtagctgaga 
ggcggggttt 
cgttggcctc 
tattttttta 
ctcacgtgat 
cgctcagcct 
cttctctctg 
tactctgctt 
gcccccatcc 
tgatttgtgt 
ttgttgaata 
gtttggaatt 
ctcctggatt 
aggtttttgt 
aggagaccac 
tggctcactc 
aggagtttga 
ttatccaggc 
aatagcttga 
ggtacatgga 
aaacaccgca 
ggggaacatc 
cttagaggac 
aacctgcaca 
agaagaaaca 
ggtgactcag 
ccaggagttc 
aacaaacaaa 
ggctgaggtg 
gccactgcac 
taataaaaaa 
gatttatcaa 
acactactca 
cfcagaggctg 
ggctctgcag 
tcttctagtc 
ttatttagcc 
gtaatcccag 
cagcctgacc 
ggtggcatgt 
caggagacag 
agcaagactc 
gtgcagccat 
ctatagccca 
taacacaaaa 
caaacatccc 



gatcaaggtc 
gagagtcatg 
cctgtaatcc 
agacctgctt 
tggcatgtgc 
ggaggtccag 
gagaccctgt 
aaaaatgctt 
atggaccctt 
ttcctcatct 
ctcatttggt 
aggggcactc 
ttgcacctgc 
ttttgacaga 
tgcaacctct 
attacaggtg 
tgagacagag 
gcaagctcca 
ctacaggcgc 
caccatattg 
ccaaggtgct 
gtagagatgg 
ccacccacct 
ccttcttcat 
gaagctctac 
acctctatgg 
cgccccatga 
gcctgctgcc 
aatgaaaagg 
acctctggga 
ttgttttctt 
gtttaattaa 
cataatagtt 
ctgtaatctt 
gaccagcctc 
gtggtggtac 
atctgggagg 
tgaagctgga 
tgttctcact 
acacactagg 
gggacaatag 
ttctgcacat 
aacaaacaaa 
gtctataatc 
aaggccagcc 
caaaatagcc 
ggagaatcac 
tccacctgga 
cagagaagag 
cgttctcctc 
taggttaagt 
cttcacagct 
ctgattattt 
tgttaagtgc 
taacaaatag 
cactttggga 
aacatggtga 
gcctgtaatc 
aggctgcagt 
tgtctcaaaa 
ttcggaaaac 
gcaattccac 
atgtgtatac 
agcagtccat 



aaactcctca 
gatccctttg 
caacattttg 
ggtcaacata 
ctgtagtctc 
gaagctggag 
ctcaaaaaaa 
gtgtgaacat 
tccatgaggc 
caacccacat 
gtgttcattc 
agcacatgtg 
tattccttct 
ttccagctct 
gcctcccagg 
cacaccacca 
tctcactctg 
cctcccgggt 
gcaccaccat 
gacaggctgg 
gagattacag 
ggtttcacca 
tggccaccca 
ttctaatgta 
tctaggctgg 
aagtccccac 
gatgtacatc 
tatcacaggg 
gaatggtggg 
ggccaaggac 
ttggttttgt 
tggagcatat 
cacattttgc 
atcacttcgg 
accaacatgg 
atgcctgtaa 
cagaggttgc 
agccatcatc 
cataagtagg 
gcccgttggg 
gtgcagcaaa 
ggatcctgtt 
aataattccc 
ccaacacttt 
tgggcaacat 
aggagtggtg 
ttaagcctgg 
gcaacagagc 
gaaagacacc 
tacccccaaa 
ataacagcat 
taatggacaa 
gtgtgatttt 
tgaaaataaa 
ctattataaa 
ggccaaggtg 
aaccccgtct 
ccagctactc 
gagccaagat 
aaaaaaaaaa 
agtttggcag 
tcctaaatat 
aagtatttat 
cagctgatga 



gctctacaca 
agaatacaaa 
ggaggctgag 
gtgagacccc 
agttactggg 
tgagccgaca 
aaaaaaaaaa 
ggacacacac 
tctagtggtc 
ttccatggag 
actgctgtgt 
cagtgaagag 
tgtgggaatg 
gttgcccgag 
ttcaagcaat 
cctgtgagct 
ttgcccaggc 
tcaagtgatt 
gcctggctaa 
tctcgaactc 
gtgtgagcca 
tgttggccag 
aagtgttggg 
ctcatccttc 
attcagggcc 
actgatctag 
ttgtgggggc 
cctggcgcct 
aaaggaagct 
tttcagtctt 
ttgtttttaa 
attggaaagg 
agccataaaa 
gaggttgagg 
agaaacccca 
tcccagctac 
ggtgagccga 
ctcagcaaac 
agctgaacat 
gagtgggggt 
ccaccatgac 
ttgttttaag 
atttaaaaca 
gggaggccaa 
ggcaaaaccc 
gtgcatgcct 
gaggcggagg 
aagattttgt 
tgagatatat 
actaattcct 
tatcacattg 
gagcactgag 
gtccaaatca 
agtatccaat 
tatttaggct 
ggcagatcac 
ctactaaaaa 
aggaggctga 
cgtgccactg 
aatctctgca 
gtcctcaaat 
actacaccca 
agctgtatta 
acggagaaac 



gggggaccta 
agacagtggg 
gcaggaggat 
tatttctaca 
gaggctgaag 
tctcgccact 
gaagaaatat 
actcatcaac 
catggacccc 
ttggactgtic 
atcccagcac 
tcagtcagct 
acagatttcc 
ttggagtaca 
tctcatgcct 
gatatttttt 
tggagtgcag 
ctcctgcctc 
tttttgtatt 
ctgacctcgt 
ctgcactcgg 
gctggtcttg 
attacaggca 
acaactcagc 
ttgtccacat 
aataatcagc 
aggaaccacc 
aataagcttg 
gaaaaggtag 
gcagggtagg 
tgaagggtag 
acagagacct 
aggaatgagg 
caggcggatc 
tctctactaa 
tcaggaggct 
gatcgtgcca 
taacacagga 
tgaaaacaca 
tggggggtaa 
acacgtatac 
aagaaataaa 
ataaaaaata 
cgcgggcaga 
tgtctctaca 
gtcatcccag 
tagcagtgag 
ctctaaataa 
ttccatatct 
tcctaaactc 
gctgtcatgt 
acagggtggg 
ctccatctca 
tcaattcatt 
gggcacagtg 
ctgagtcagg 
tacaaaaatt 
ggcaggagaa 
cactctagcc 
tgaagaatgt 
agttaaacat 
agagaattga 
ttcattacag 
aaaatgtggt 



gagtcaactt 
cgcggtggca 
cacttgagcc 
aaaaattcag 
taggatgatc 
gcactccagc 
gttattgatc 
attcacattt 
catggctgga 
tgctgcatga 
ccagaacagc 
ggtttcacac 
ttcatttctt 
gtggcacgat 
cagcctccca 
tcttttcttt 
tggcgtgatc 
agcctcccaa 
ttttagtaga 
gatccgccca 
ccattttttg 
aactcttggc 
tgaaccactg 
tcaagtttca 
acccaccaaa 
ttagttttct 
acgtggtagg 
cggccaacat 
gctaaaatca 
taacaggaaa 
cgttatcgtc 
taaagcagtt 
ccaggcatgg 
acctgaggtc 
aaatacaaaa 
gaggcaggag 
ttgcattgca 
acagaaaacc 
tggacacaga 
ggggagggaa 
atatgtgaca 
gaaaaaacca 
ggccaggcat 
tctcttgagc 
aaaaatataa 
ctactcaggt 
ctgagatcgt 
ataaataaaa 
gaatcaatag 
tgttctcctg 
gggctcctgg 
tctaaatcct 
tgagcctcac 
atttaatgaa 
gctcacgcct 
agtttgagac 
agctgggtgt 
cgcttgaacc 
tgagcaacag 
acataaaatg 
agagttacca 
gaatatttgt 
ctaaaaagtg 
atacccatac 
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aatgtcatat 
cttgataatg 
gcatttatat 
tgccagagac 
tttcttcttg 
tgtgaatata 
gtctcgctct 
.ctgcctccca 
atgtgctacc 
gcccaggctg 
gctaggatta 
attgtatggt 
cctttgcgtg 
gagaacacag 
gaagcgccat 
atctagactg 
tgtcagaatc 
acaacaggag 
ataggatgca 
ggatcttctg 
cctccctaac 
ccagtggct't 
catccagcct 
ctaccatccc 
accgggtctt 
atcgagatgt 
aatgcacaaa 
ctctgtgtct 
cctcaagttc 
atcttttgtt 
ggctggagng 
agcaattctt 
cggctaattt 
gaactcctgg 
tgagccactg 
cctcttgtag 
catctgagtt 
caagggcctt 
acccaccatg 
accttctggg 
cctgctgaac 
ggctgacctc 
gattacaggc 
nnnnnnnnnn 
nnnnnnnnnn 
aaaaaaaaaa 
tagtccaagc 
tgcagtgagc 
cctaaaaata 
aaggatgaga 
ttagcaattg 
aatactatat 
ggaagagaaa 
tattaagtat 
aacccaaaag 
ctaccatttc 
gggcaagaaa 
aactctctag 
tcaatggcca 
acatgtcaaa 
aaggtcacta 
ttatttctgg 
aaaagtagaa 
agaactttgg 



tatttggcca 
ttattctaag 
gaagtgccca 
tggagggagg 
gggtaatgaa 
caacatgctg 
gtcacccagg 
ggctcaagca 
atgcctggac 
attttgaact 
caggtgtaag 
gtgcgaatta 
tgtcaggcac 
agcccagcag 
ggaagaaaga 
gggctcaggg 
agtagaattt 
gatgcggaga 
gagcctgtgg 
agacaataaa 
ttgcttgctt 
aagacacgaa 
atcagcgagc 
ccaagcccag 
cccaatccta 
agatttgatc 
ctcccaatga 
t-gccatggt 
ttcctgggct 
tgtttgtttg 
caatggtgca 
ctgccttagc 
ttgtattttc 
cctcaaatga 
tgcccaggca 
taactagcct 
tccctttgat 
tgcccacctt 
cctcctgcct 
aggcagtctt 
atctcttttt 
aaactcctgg 
atgagccact 
nnnnnnnnnn 
nnnnnnnnnn 
aaaaaaaaaa 
tacttgggag 
cgtgatcgta 
aagaagaaaa 
aaggaataag 
tattgttatc 
atgtgtatgt 
gggctacaaa 
ttatgatatt 
cagtgacaac 
caattaaaga 
cacacctagt 
aaataatgtg 
aaggtgagac 
tatattttaa 
gggcaccaac 
gtaacaaaat 
gatagaaata 
gaggccaagg 



taaaaaggaa 
tgaaagaaac 
gaataggcaa 
agataatggg 
aatgttgtac 
aattttatac 
ctggagtgca 
atcctcctgc 
aatttttgca 
cctggactca 
ccaccactcc 
tatctcaatt 
tagggattcg 
gagagagtca 
catggtgcca 
aggaatcttt 
atcaatcctc 
cattccgaga 
tgtgaggttg 
gagtatgcaa 
ccactgttgc 
actcaagtca 
ttgtgggtct 
atcaccatca 
cccccgcaac 
aggccacatc 
tggcccctga 
aacctccttc 
ggaacattct 
tttgtttgtt 
atctatctcg 
gtcctgagta 
agtggaggtg 
ccaccctcct 
acacttcaga 
gactccagca 
aaaactcatc 
cctcaccact 
gacaaattcc 
ccctgagcac 
tgaggggcgg 
cctcaagcga 
gtacctgacc 
nnnnnnnnnn 
nnncctgggc 
aaattatttt 
gctgaaggga 
ccactgtact 
tatggcaatt 
gaaggacaga 
aagtaacatc 
acatttacat 
tgtaaaatca 
tttcttataa 
ccaggagcaa 
aacccagggc 
gaaatggacc 
atttcatcta 
aatttgagca 
aatgaaatat 
tcatatttca 
agtaaggaat 
gaaaatcatc 
caggtggatc 



gtactgatac 
cagacacaaa 
atccacagag 
aaatgtggaa 
aattagataa 
tttattatat 
gtggcacgat 
ctcagcctcc 
tttttagtag 
agtgctccgc 
cggcctaaat 
taaaaaaaac 
atgctgaata 
cagatgaatc 
tgagaacata 
cagggtgatg 
cagaggagga 
gctgatcaag 
cagctggaaa 
taatctcaaa 
ccgtatcata 
tcccaggctt 
taccacgtaa 
gctcaggcct 
atgaccccaa 
ccttgaaagg 
gtcctgtgcc 
acacccatta 
ctgcactagc 
tgtttttgag 
gctcactgca 
gctgagacta 
ggttttcacc 
cggcctccta 
tcttaatgat 
atgaatcctt 
atatatttgt 
atcctctcac 
taaccatcct 
caagacaatg 
gtagagatga 
tcctcctgcc 
gcaactgggt 
nnnnnnnnnn 
aacaaagacc 
aaattagcca 
gaggatcact 
ccagcttggg 
tgactgtaca 
aaaaaggaaa 
aatattggta 
atgcatatgt 
aggaaataaa 
aaaaattata 
taaaaaacct 
tagttgggaa 
tgaacattta 
aaagacacag 
tcaagaagaa 
tataaaagaa 
aactggtaaa 
gtttttcttt 
cttttggcca 
actggagatc 



atgctacaac 
agaccacata 
acagaaagta 
tgactgctaa 
tggtgatcat 
tttatttttt 
ctcagctcac 
tgagtacctg 
agacagggtt 
ccacctcagc 
tgtattcttt 
aaaacaaaac 
agacacagac 
aagtgttaca 
cgcttagaga 
cttgtgctca 
aacagcaaat 
ggctggtgtg 
ggtaaaacac 
cgaccgaaac 
aaagcaccac 
tctccccacc 
agacttctca 
ggactcctgc 
tagcccatca 
cttcctgtga 
atctgggtct 
atactccatg 
ctagccaact 
acagtcttgc 
acctctgcct 
taggcaccta 
atgttggcca 
aagtgctggg 
catttccttt 
ttgcaatgta 
tcctctgaca 
cacttaacac 
tcaaatctca 
ggacacattc 
gtgtctcact 
ttggcctccc 
tagnnnnnnn 
nnnnnnnnnn 
cttcctctac 
gacatggtag 
tgagcccagg 
caacagagtg 
tctctaatgg 
caaagaagta 
aaaccagtaa 
taggaaccaa 
aatttgaata 
tatatgttaa 
cagcatccag 
aaatgacaat 
attgtgttag 
attctgggct 
tcatgacaga 
acaattagta 
taaatgtgta 
ctagaagaat 
ggggcagtgg 
aggagtttga 



atggatgaac 
ttgtatgact 
gattagtggt 
tggtatgggg 
tgtaaaactt 
ttgagacaag 
tgcaatctct 
acactacagc 
tcgctatgtt 
ctcccaaagt 
aaaagactga 
aaaaaaaaaa 
cctaccctca 
tcatctatag 
agggaatttc 
gagttttcca 
gaaaaatctt 
aacaaagcac 
taattacatt 
tgaccttcct 
cctcttctac 
tcactctcca 
tctccagcta 
aacctttcta 
gaatggacta 
ccctcgggga 
gccctctgcc 
ctctctccta 
aaccctttag 
tctgttgcca 
gccgggttca 
ccatcacgcc 
ggctggtctc 
attacaagca 
aagtgcctga 
acctatataa 
gttcagaggg 
agaactcacc 
ctcacctatt 
ctttatacac 
atgctgccca 
aaaatgctgg 
nnnnnnnnnn 
nnnnnnnnnn 
aaaaaaaaaa 
tgcatgcctg 
aggttgaggc 
agacctcatc 
gatatatcct 
gcaacagtat 
ttatatttaa 
gtttatcaga 
aaaatatcag 
ctctatccaa 
actgtagtct 
ttcatgtcta 
aaagtaagga 
ggtaaagttt 
acagattaaa 
gccatccctg 
agccaagcat 
tctagtgatt 
ctgtaatccc 
gaccagcctg 



85440 
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gccaacatgg 
ctgtaatgcc 
ggtcgcagta 
gtctcaaaaa 
tgggcctagg 
actttatagt 
taacaagaca 
ccactgagtt 
ttctaatcct 
aaaacaacgc 
aaaaggtagg 
tacactgcat 
ttgaaatgat 
gttatttttt 
gttagagata 
gatttgcttt 
tagatgaaat 
aaatacctca 
gatgtggtaa 
ccccagtgct 
tctctcattc 
ttcatctcct 
aagccttccc 
ccccccatgg 
ctccaccctc 
gaatccctgg 
aatgaatgaa 
cttcacgcct 
gagttcaaga 
gggggatggt 
cttgaaccca 
ggcaacagag 
agatgtcatg 
cccgcatctc 
gcaggccatt 
aggtgctgac 
ccaggcaggg 
accctcctct 
gatgggctac 
tcagcaagga 
acaccaggtg 
gcttctgccc 
ggaaagtgga 
cccaccttct 
cacacatccc 
gttcttctct 
taataccttc 
gtcagaaggg 
cctttctttt 
catggtccag 
gatggttcct 
aaataggtcc 
accctattct 
atggtaaaaa 
aggtcatagg 
catcatctca 
aagtcccaaa 
aggcctgttc 
tctcccaccc 
agacgcagaa 
cggacccagg 
agaagagggc 
aaggggtggg 
gccctggacc 



tgaaacccca 
agctactcgg 
ggctgagatt 
aaaaaaaaga 
cattgatcat 
ggatggatca 
acagacacca 
attcacggca 
gttgtttaca 
caaatccaaa 
ggggaggact 
ggaccaggca 
tggaggaatt 
aggtgtgatc 
catgtacggg 
gaaaactcta 
aagtatgcaa 
gcatacctct 
acagcctgac 
ctgtgcccta 
accattttga 
ccctccttac 
tgacaccaat 
ccccctgtgg 
tagactgaga 
tgtctggcac 
tgagtgcaac 
gttatcccag 
ccagcctgac 
ggcacacgcc 
ggaggtgcag 
taagactcta 
gggtaaatga 
acctgtcttt 
ccagggggca 
tgcagaagga 
actctcccac 
ggtgtgtaca 
ctcccagccc 
gttgagagtg 
ccaggcaaag 
tgcccctggg 
caggtaggat 
acccttggct 
cacctccctg 
attgaaatcc 
ctgatctttc 
agaggagaat 
gataccagta 
atgggaccat 
cctaacaagg 
tacaccctgg 
tctttcttta 
taaaaacaaa 
atacatatag 
catttctacc 
cacaatgcag 
atcctgctca 
cgttcttatt 
cagagaggaa 
agacctggag 
agaaccgagg 
gaagtgatag 
ctccattaag 



tctctactaa 
gatgctgagg 
acgccactgc 
aagaaaagaa 
taatggctcc 
gactcgcaat 
ggggctgctg 
aaaaaaaaaa 
ggaaataccc 
atatggggaa 
gttctagatt 
tggatcctaa 
tgaacagtga 
atgattttat 
tatagagaaa 
gcaggtgtgg 
aatgttagtc 
actcctccaa 
cccttcttac 
gccagcccga 
ccccactgtg 
tcatccttca 
cccatcctca 
gtttgcttca 
acaccttgag 
catggctgac 
tagacagggt 
cattttggga 
atggtgaaac 
tgtaatccca 
gctgcaatga 
tcacaaaaaa 
agacctccct 
cttgcttcct 
cacagcactc 
agagagaggt 
cttacccttg 
gccctgctgg 
taccacatac 
ccaatgaaga 
cgccgagaga 
gtctctaatg 
ggggagtgtg 
cccagaactc 
ctttgtcatg 
gatctgtctc 
tcagcgagaa 
gagtgtggat 
tcctgagttt 
gaattcagtg 
cagaccggcc 
caagttcaat 
aaataaataa 
aaaatcaatt 
ctactaagat 
tgttttgtga 
tttgatgacc 
tctgctcctt 
tctactgtag 
ggaaaggaag 
aaatgggggc 
agtgggggga 
gatgcgaaat 
tgagagggtt 



aaatacaaaa 
caggagaatc 
actccagcct 
aatcatcctt 
taaaatcact 
gtctaaacca 
acaggagaac 
aaaaaaaatt 
aaggaaagga 
atgacccagt 
ttaaaagcta 
ttggaacaaa 
ctgcacagta 
ggtgatgttt 
tatttacgga 
gagaagcagg 
tacttttgtc 
ccaggtccaa 
ctctctctct 
ctcatctgcc 
ctcctgggcc 
ggtcttggct 
cctagaacag 
cggattataa 
aaaaagaaca 
acataactat 
taagaacaaa 
ggctgaggcg 
cccgtctcta 
gctacttggg 
tctgagatca 
aaaaaaaaaa 
tcctggttcc 
tctggtactt 
cgaactgagt 
tggttgatga 
tctgcatgtc 
gaggctctgt 
ctcgtgccat 
ggtagtgcca 
aagcgtagct 
gggagaggag 
gaggcttcaa 
acccggcgtg 
ctggccctac 
tcacagcctg 
aggtgaaagg 
actctgttct 
ctttgggaaa 
ttctgfctcct 
atgagtttag 
gtcctccctt 
ataaatacat 
ttattctgta 
ggttactaca 
caagagcagt 
tacagtcttc 
tttgtccttc 
ctagctgcgg 
cagtgataga 
actgtaccag 
agggaaggca 
agagaaaaga 
gggaagatgc 



attagccagg 
gcttgaacct 
aggcaacaca 
ttgcgatcct 
aagtatatgg 
gttgatcaat 
agaggaaacc 
aaactgcgtt 
atacttttaa 
ttcttcaaca 
tagaagacac 
ccaactgtaa 
gatgatatga 
aagtaaaaga 
tgaaatgata 
tgcatgggtg 
cctcctgacc 
ggatcaggcc 
ccagccactt 
cagattcctc 
actctccaag 
taggtgccat 
attatgtgcg 
ctgcctgact 
catctatctt 
cactcagtga 
agagaagacc 
ggcagatcac 
ctaaaaagac 
aggctgaggc 
caccactgca 
aaaaaaaaaa 
ctgaccagcc 
ctgtttaagc 
gaaggcaaag 
gaagtttcca 
cctcctcccc 
gttcccagct 
gtccccaccg 
caactgtatc 
ccagaaggag 
gatctgggcg 
tggaacattt 
ctctgcacct 
caccttggat 
gtcaatgaca 
aacgacaagg 
aacctgctcc 
tcttcctcta 
ctctcaaggt 
attaggatgg 
gattttggaa 
gttttggatc 
tatttaagat 
gttaagcaaa 
taaaatctac 
gtgctgtgca 
gacctgcatc 
tttgtgatgt 
gttgggacaa 
acttagtgca 
acccatggca 
gggacagaaa 
ctaaggccct 



tagtgggtgc 
gggaggcgga 
gtgagactct 
aatgaaataa 
ttgatgggaa 
cttaacatcg 
catagctcta 
tcctccaagc 
atgacacatt 
aataaacaag 
agcaaccaaa 
aaggatgtat 
agaaattatt 
ggccttattt 
cgatgtctga 
ggggaaggga 
cagcaggtta 
aaaactccct 
ccctaagatt 
aafcgtttcac 
gccccgcctc 
tccctccagg 
cttctttgtg 
acctgccttt 
gtctgtcatt 
ctagtgtttt 
aggcatggtg 
ccgaggtcag 
aaaaattagt 
aggagaatct 
ctccagcctg 
gagcgagaga 
cctgccctcc 
cggtccatga 
gcatttgtgt 
aaactccctt 
acaccatcag 
gggacatgca 
gccatgttgg 
acagacagcc 
cggagttcct 
tgaattacga 
cagatccggg 
gcctcggact 
gaccctctgt 
cttcttgcag 
agaggagaaa 
tcagcacctc 
ccctaatctt 
taaccaatga 
acttaatcta 
gcttcccaga 
caattgtcag 
atacaatatg 
ttaacatatc 
ttgtttagga 
ttagatctct 
tcccatctcc 
gtgtaaccaa 
taagagaggg 
atggcatcac 
ggcgggcttc 
agagacgaaa 
ttttctgtcc 
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tgcctttcct 
aaggagaggc 
tgaaaggaac 
ctttgtggtg 
aggactgtga 
ccaccgcact 
ttaaatcaga 
ctagtgattc 
ttcaaatccc 
gagcctcatt 
gatgaaatca 
ttcctctact 
atttccccca 
ccttgtgaac 
aaatctcaag 
atgagcccac 
ccaaacctat 
atcagatgta 
agccagctac 
tttgagatag 
tcctgtgata 
cccacaatca 
gtgcccccgg 
gaagatgaag 
gaaaaagtca 
aagcaaacta 
atggaatatt 
tcccagcact 
catagtgaga 
catacacctg 
gtttgaggct 
gaccttgcct 
catggatgaa 
tagtgcttta 
gggaaacagc 
gtgcacgcct 
tacatgcctg 
agttccagac 
agtggggcat 
tcgcttgaac 
ctgggcaaca 
tgttatgtat 
gttttctaag 
cctggtacac 
acgacccatc 
cagtgccttt 
aaattcatct 
cttggtgact 
accaccgacc 
gtgcagtgaa 
gaattcctgt 
catggtggct 
aggtcaagag 
caaaaataag 
ggagaattgc 
tccagcctgg 
aggaaagaaa 
gtgactctta 
ccctccaaac 
agagatcctc 
ctaagcccag 
cactcagcca 
catggttgcc 
ttgtttttac 



gattctgggt 
ttgggtctgg 
tcctttgact 
ccattcccag 
gccgaacctg 
ttggaatcac 
atgtctggag 
taataagcag 
agctctgcct 
ttttgcattt 
gagagcacat 
ccttgtggca 
atttgaatga 
tgcttcagcc 
aacgcttgtg 
gctggcttgc 
gtgagtgagg 
tgaatgagcc 
ccacagactt 
tttgttatgc 
ggtttgcttt 
ccagatagtc 
ggcatcagga 
gaggccctga 
gtgtcacacc 
agggtccatc 
attcagcctt 
ttgggacgca 
ccctcacccc 
tagtgccagc 
gcagtgagtc 
ccaaataaaa 
ctttaagagc 
gaagaccaag 
aagacctcat 
gtaattccag 
taatcccagt 
cagcctggcc 
ggtggcacgc 
ccgggaggcg 
gagcgagact 
attttattgc 
gagtcctagt 
acatgccccc 
cccacagccg 
tttcccccaa 
tacaaaacac 
ggtatgacat 
tagcccaacc 
ttgctcaata 
tcctgctccc 
catgcctgta 
ttcaagacca 
ctgactgtgg 
ttaaacccac 
gagacaaagt 
gatgaggcag 
tgccaggagt 
ctccaaagct 
ttctctgccc 
agagaagtcc 
tctgtggtgt 
acaactccag 
tctgctttca 



ccctggggga 
aggagagatg 
gtagctccct 
ggcacaggct 
agactcagac 
cttttagcca 
gtgagagcca 
caaagtttag 
cttcctagca 
agaaaatgga 
ggggtctgac 
aatgactgca 
gactgcagct 
aaaagaatgc 
catttctgca 
tggaggatga 
ccatcctagc 
ctgtcaaaat 
gtgaaaaata 
agcaatagct 
accctgtcca 
cttacccata 
ggaaagtgcg 
ggctgatgcc 
cacgttcata 
agcagatgaa 
aaaaaaggaa 
cgaggatcac 
ttctctagaa 
tactcaggag 
atgactgggc 
aaaaaaaagg 
actatgcagg 
acaggaggat 
ctctactaaa 
ctacttggga 
actttgggag 
aacatggtga 
acctgtaatc 
gaggttgcag 
ctgtctcaaa 
atacaaaaac 
acaagccatc 
acccacccca 
gtggagcctc 
cgacagcaaa 
accaatgggg 
caaagagaat 
tcctcccatt 
aattgacaga 
tttccccttc 
atttcagcac 
gcctggccaa 
tggcacgcct 
taggcagaga 
gagactccaa 
ccatctgggt 
agaaaggctt 
tatgggacta 
ccataaccrtg 
caggttggat 
gctggcttta 
ctatcacatc 
ccttgcgtca 



gctctggagg 
ggttagccag 
gttttcatgc 
atggaaacaa 
tggagggaat 
ctctgatgcc 
ggcttccttt 
gaagcatgaa 
acagaaagat 
gataaggata 
aattagtaag 
aaaatgatca 
ctatttcccc 
agcagaagtg 
ctctttcaga 
cagccacgtg 
atagccagcc 
catctacatc 
ataaatgctt 
aacagatgca 
tcccacccta 
gctgtagttg 
tgctacatgc 
tttttcataa 
gcagcactat 
cagctaaaca 
agaggcaacc 
ctgagcccag 
aatttttatt 
gctgagtggg 
cactgcaccc 
aaaggaaaga 
gccaagctca 
tgcttgagtc 
aataaataaa 
ggctgaggtg 
gctgaggcaa 
aaccccgtct 
ccagctactc 
tgagccaaga 
aaaaaaaaaa 
atcagcagaa 
acctactatc 
agacacaccc 
ctcgtgtggc 
ggccttttgt 
agcttgccag 
ccctaagttc 
ttacagagag 
gtaagcagca 
caactctaga 
tttgggaggt 
catggcgaaa 
atagtcccag 
ttgcagtgag 
ctcaataata 
tctccagggg 
gagtgcctca 
agctagactc 
caggcagtgc 
ggcttgagat 
gtcctccagc 
ctcacaccac 
gggaagaaaa 



tgaggggcca 
cagggctcac 
tcagctgttt 
aagccccacc 
agcatggtga 
caggttgcag 
tctaagatct 
aagagtaggg 
ggctcagact 
tctcagagga 
tgagcagcaa 
catttcttca 
atgccctgaa 
gctgtgccaa 
accctgaaat 
acccaggcat 
cccatgtaat 
tggccctgat 
aacattttag 
ctgctccagt 
gggccaatga 
gggggcagtg 
caggtactgc 
agaagctgca 
tcacaatagc 
taatgtgatc 
• atgctggctc 
ttcaagacca 
' taattagctg 
aggatttctt 
cagcctggac 
aattctgaca 
gtggttcctg 
caggagcttg 
taaatcagct 
agaggatatg 
gcagatcacc 
ctactgaaaa 
gggtggttaa 
tcctgccacc 
ggttaagata 
gaggcagggg 
ctgtaagctg 
ggcagtagag 
tccccagaga 
tcaaagaaaa 
tcatctccct 
ctcaacagct 
tgacgttgag 
gcaaagtcag 
gagacaggag 
caaggaaggc 
ccccatctct 
ctactcagga 
ccaagatccc 
aaaaaaaaaa 
aaggagggag 
ggggctcagt 
atgtctgggt 
caactgcctg 
ccacactctt 
ttgctgcctc 
aacattcaat 
gtccccttga 



ggaaaggcac 
cttccttcgc 
ccttctcttc 
agcaaggcca 
atcccacatt 
accagaccag 
ctatgtgaat 
caggcccagg 
taacccttct 
ttattgtgag 
aggaatgccc 
cctcctctgt 
tctgggccag 
ttccaagctt 
cacggtgtga 
ccctgtcact 
cctccaaatg 
cagcggaact 
gctgctgagt 
cctcctcctc 
ggggctctgg 
ggtatgggat 
agacagagat 
gaaggagaag 
caaaggatgg 
tatacacaca 
acacctacaa 
gccttgacaa 
ggtgtggtgg 
gagcccagaa 
aatgaaacat 
catgctgcaa 
cctgtaattc 
agaccagcct 
gggcgtgatg 
attacatgat 
tgaggccagg 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
gtaaatttta 
ctggaaccct 
attagggaca 
gagtcctcat 
tcttctagcc 
ttttacacaa 
ctttattctc 
cagtttgaaa 
gtccagagag 
attaaactaa 
agaggctggg 
ggattacttg 
actaaaaata 
ggctgaggca 
accaatgcac 
aaaaagagag 
aacccagaaa 
ctctgcataa 
ggtgactgcc 
tgacctaaca 
cccttccttt 
ataattgaag 
gaggaagact 
atcttccact 
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atatacactc 
ctagcaacgg 
gctaagcggg 
attgacctct 
gacagctgga 
ttgtccaaac 
cccaacactg 
aggggcttag 
tatcatggct 
ttcatctgta 
tgcaggttca 
tatatttacc 
agtgatgatg 
gcctggatca 
ccctccttat 
cccatctgag 
ctgccgagtg 
ttcccccaat 
caaagtcaga 
cccacctgct 
cagcgtcctt. 
gtcttctgag 
gtctacagac 
aagctgcctc 
ctggctcaac 
tgtttaatgg 
gaaaaggacc 
attgtgcttg 
acatgcaaac 
agagctgggc 
cattattctg 
cacttttatc 
aacactggca 
aaacaatgtt 
cctgctcttc 
aatcagaagg 
tttgttagtg 
ataccttaag 
tttctgtctt 
atagagtcct 
agaggaggat 
ctccacattc 
ataaggagag 
tgaacaaaga 
tgtcagcttt 
attgagccac 
ccaaggcaat 
ccacttgctt 
tttctcctct 
cttaacagac 
taccctgcta 
agaaagagaa 
tttttagggt 
caatggtgtg 
ctcagcctcc 
tttttttagt 
cagatgatcc 
cctggccaga 
ccttgctttt 
ttccaggatt 
atagccctct 
cccacctgac 
aaaaatctga 
gaaaaaacat 



cctttatctc 
gtaaatggat 
tcactgccca 
aatagccttg 
caagcctctg 
tagaagctca 
gcccaagcac 
cggttacaag 
actagctgca 
aaatgggggt 
caagacaagt 
ttaataataa 
gtcgaacggt 
cttatttatt 
acccaaactc 
gcagccaatt 
cctgggatta 
ggtgagcatt 
ggtagggtgg 
ccaaattccc 
caagggtttt 
ccaccttctt 
agaggcagag 
tccctttcat 
cctctagccc 
gaacaacagt 
atggtctttt 
gcactggaaa 
aaatggttat 
taggggaacc 
gccacaacct 
caaaaagtat 
ttacttgtca 
attcaattgt 
ccatcagtgt 
actgaaagag 
tagttcacgt 
aagcactgcc 
tataaaatgt 
tatgggctca 
tgttgctcag 
agttgcttaa 
tggtttgaaa 
cactcagcct 
agcatagctt 
gggtctaaca 
gacagaaaga 
tctctgtggc 
gtgtaatggg 
tggtgaaatt 
agtaaggata 
cattttaaca 
tttttgtttg 
atcttggctc 
tgagtggctg 
agagaaggga 
acccgcctca 
tgtaaagttt 
cacattttct 
ctttcaagct 
gctcacacct 
agcactcacc 
aacacataac 
gattctgctc 



attaaaaagg 
tgccatggtt 
gataaaactg 
gcaacagttc 
tgtctgcccc 
cagcagcaaa 
attatgacca 
ctcaggctct 
gaaccttcaa 
aacacagtac 
gtctggcata 
ggaagactgc 
gttggccaaa 
catgaaacag 
accctccacc 
atagaaaagg 
gggaaggctc 
ggcatggcca 
tgtcatagtg 
aacaatcctg 
taggataccg 
cccaaccaaa 
ctctaagtga 
ccttgacact 
attccctaat 
tcctgctctt 
ctttgttcac 
tgcaagatga 
aacaggtctg 
cttgacaaat 
atgcctgtcc 
ctatttttaa 
tgaatagaaa 
agctggatgc 
taaaatcaca 
aaataaaaag 
agtatgcgtg 
aagagctaat 
acatctctcc 
atgctttttt 
acgatttgca 
aaatcattta 
taattggttt 
ctgtgtttgc 
acacacacaa 
ctaggaagga 
agagtgaggg 
catggacatg 
tggaagggtg 
ttaaaacaaa 
tttgcaaaat 
cataggagaa 
ttttttggta 
actgcaacct 
ggattacagg 
tttcaccatg 
gcctcccaaa 
tgaataaatt 
caataaactg 
gcagatctct 
gccccagcca 
tgcagaggtg 
aggaaaagca 
agccagccca 



actggatcat 
ggctttaatc 
ttcgcaatga 
atcccctgat 
tgctgtaccc 
cgccccactc 
ctgccacaaa 
agaaccgaca 
caagcttttt 
ctacctcacc 
tacaagtgcc 
cgaggaagag 
taggttccca 
atacttcctg 
gctagagctc 
gtctctcctt 
ccacctgcag 
ggaagagcag 
caacccaaac 
gtatccaggc 
gccaaggctt 
gttggtcctc 
agccacctct 
cctggaaaag 
tcatggtatt 
cctgtctcag 
taaactctga 
atcagatagt 
gtaagtgtga 
tcttcctccc 
tattatttgc 
aggacacttt 
gtaactgtca 
atttgacctt 
ctggccccac 
ggaatagctg 
tgcccctaac 
tttagagtat 
tactactaaa 
caaaactgaa 
ggcacgagtc 
caggctttta 
gaggtgctgg 
ccagcatgag 
gagtaatgta 
atattgggag 
atcaggaggc 
tctttgcaag 
gtggaaaata 
gattttgagt 
ggtcattcat 
ggatgtaaag 
gagtctcact 
ctgccfccctg 
cgcgcaccac 
ttggccaggc 
gtgctgggat 
ctactctctg 
ttttcacaga 
gaataaccat 
ccaggtgcct 
aaggctatga 
gaatattgtc 
caagtgtagg 



atgctgacct 
aatcaggatt 
ataagacaga 
accccaacat 
actagccact 
taaaggtccc 
agcttgggca 
agcctgggtt 
aacctcagag 
gagttgatgg 
cagtgaatgt 
tcaaatgctc 
atctggggat 
agcacccagc 
ccacctcagc 
ccctccacct 
gttggtgatg 
cacggctgag 
atggagcccc 
cccaattcta 
ccccagatat 
agtctgtggc 
ctcttccctc 
aagaccctgg 
ggccttgagc 
gtgctatgag 
gcacttcttt 
ccttgccctt 
gaccacagca 
caaaccagac 
taaaatgttt 
atatcactac 
aaataaatac 
agaatgttca 
atgaagacat 
ttctaccagg 
atcctcttaa 
tcacacagtt 
ggttggagac 
aagccctata 
aaacattacc 
gagtagatga 
gccatctcat 
tgcagacaat 
ctttctttcc 
gcatctcgtg 
ctgcacatca 
gggtcctgct 
aaccagattg 
acaatagggt 
ataatcattt 
gttttttgtt 
ttgtcaccca 
ggttcaggca 
catccagcta 
tggtcttgaa 
tacaggagtg 
aagtaatccc 
ccagcaatag 
gtggtctgta 
ctgagcttgc 
tgagtgtggc 
aaggagggag 
acttgaccgc 



ccacctatca 
catcccctgg 
atggttgttg 
cagccactgg 
tgccaccttc 
ccagcctct'a 
agtctgaaga 
caagttccag 
actcaaatgc 
agacaaataa 
aggctgttgc 
cattgtacag 
gataggacta 
atgtggcaga 
ttgggccaac 
tcccgccacc 
agaaacagga 
cctggaccat . 
aaactctgct 
gccagcgttc 
ctctgtggaa 
aggccaggaa 
agtaaaccac 
actcaggtcc 
ttcaatcatc 
aactgagtga 
ggtgccaggc 
aaatagactg 
aaaaagctca 
ttctgcccac 
taagttgact 
tgtagatgaa 
aatgaaagga 
aagcctaaga 
tctttcatga 
tgatttgatg 
ctaccgtgct 
taccattcaa 
tcctttcaca 
ttatggagga 
cagccaccac 
tgctggtttg 
gagatctgtg 
ctcatgatgc 
taaaccaaaa 
gccaccataa 
ggcccacctc 
gtggcttcag 
gagttccaga 
tgtcaacttt 
cattaaaaag 
gttgttttgt 
ggctggagtg 
attctcctgc 
atttttgtat 
ctcctgacct 
agccactgca 
tctccatcat 
ctcaagatcc 
tatcttgcct 
atccctccca 
ggtggtgtag 
aaacctggga 
accctcagcc 
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tgggatgcaa 
gcgatgagat 
ccaaggctag 
acttagccag 
aaccccttgg 
ctcctcactc 
tataaaaaga 
ccaaggttat 
aaagcaatat 
gtagcaggga 
aaartgtaag 
cccgtctctc 
ctgtatcagc 
aggatgcagc 
ggaaaccccc 
ccaaagaaac 
gggggcggtg 
gatgaggaca 
ttgtaaccct 
agcacgtgat 
ggggctgtgg 
agctgctggg 
ggatttcact 
ctggcctttg 
atgtagccag 
gagaacctct 
agccttcctg 
tcgcaaactt 
ctggtgcctc 
tctcatcata 
tgggagccag 
gcattgccaa 
aagacaaaca 
agcccagtca 
ggagtggatc 
gcgtttgagc 
tggttgtttc 
aagatagagg 
tgtgaggtgc 
tactccaata 
ggtcttgatg 
gagcaggtgg 
gacagcaact 
ctgccaagtt 



aggaacttgc 
aaataccccc 
tcctcgcatc 
cccgctccag 
ggcgccccat 
tctccgcgcc 
tcccctgctc 
tctccatcac 
ccctcttgaa 
cctcgctgcc 
aatgctgtta 
cacagacagc 
cgctgggttg 
gtttcacact 
aggactgagg 
aaacccctta 
gccagactgc 
attgctcatc 
tccagcccag 
aagctcagtt 



cgggcactga 
tgagatgtgg 
catggaaagt 
gacagaatgg 
agaaaggggt 
tgtgaccttg 
aatgatcgga 
caggctccct 
cctaaggctg 
gccaaggacc 
ggactggctt 
tctctgacag 
ccccagcagg 
aacacactgt 
ccagtctccc 
ttcctattag 
aggaggatga 
aggatccttg 
ggctccaagc 
agggaagcca 
tgaagtgggc 
agcgagctgc 
ccaagctagg 
ggacttacat 
tggggaatgt 
catcctgggc 
ccaaatgagc 
gtcccttaag 
cctcccaggc 
gaaagagggg 
acttaacttc 
tcggcccagc 
ctagtgaagg 
gcagtgtggc 
ttcaaatccc 
acttggaaaa 
aggcctgggc 
ctgctggaag 
attaagaggc 
catcatctta 
gccacctgga 
agaagcggag 
caaccaggtg 
ggtgttatct 
ccaaggtcac 
aactcccatg 
aacttctacc 
cctgcggcct. 
ggcgcccgaa 
ctttccgcac 
gcctccgccc 
ttccttcttc 
acttctccct 
tttcagcctc 
ctcgctcagg 
acatgagggc 
gcagggaaca 
cccagagctg 
gcttttcaga 
gatgactccc 
attgattaac 
ggagcttgtg 
agcacgtgag 
tggggagagc 



tgcctctgag 
tcctgacctc 
cttggacaat 
gcaaagcgaa 
tagtgcctgg 
ggcaaggcgc 
ctgatctagc 
tagcatttgg 
gagaagaggg 
aagaaggact 
tactctggct 
ccaatctcac 
ccgctgatta 
gtgtgagatc 
ctccagtcag 
ctaagcccct 
ggatgaagcc 
gggtgaagga 
cagcccttac 
catttatgca 
aagtctagag 
aggtgtggtg 
ctgctgcctg 
ggctatgagg 
catgaagttg 
tttgcagtct 
acacaggctg 
ctgagttgaa 
acttcccaga 
tgggcagggg 
cttagaaaag 
atctggtcca 
ccagccaaag 
cacgtgcagg 
actttgtcct 
ttcctctggc 
aaaaaccatc 
aggaggcctg 
aggatgatca 
ggcactgcat 
gtcacaggtg 
ctcctgcgct 
cagaggcagg 
tcagtttact 
agggctagtt 
accctcgcct 
aggaaagcct 
gccctccccg 
aggggaccct 
gggccaggtt 
cctcctgccc 
ttttcccttg 
ctagaacccc 
ccagccccct 
cccacctagg 
atagggacac 
tatttttcct 
ccaaaaaaat 
aaccatcccc 
tgcctcaccc 
cattcattgc 
atcagcgtga 
ctgacgtctt 
ttcttgcgta 



ccccaggctc 
atggacagtg 
gagctcagct 
ggtcctccca 
tctgcaaatc 
tgtccttctc 
tcagagtgct 
gggtcttgta 
aggggacaaa 
gaggtacagt 
gtttccggaa 
atgtgcctcc 
ccactgagcc 
acgtcccgaa 
aagggacctg 
agggagtgat 
tgggcaacct 
agagaagagc 
aggaagtcac 
gagcagggaa 
agagtcccgc 
cctggcaggg 
aaggattcct 
cgtgccacgg 
ttcatgaagg 
ggagtagaaa 
ggctcccctc 
atgtggctgc 
tcaagtgggg 
gcagagtcct 
tcatccctgc 
cacagatcct 
aggacgccag 
ggcaggccct 
cctggacgga 
aagccaagcc 
agcgggtgat 
cgggaaaggg 
tggccgctgg 
ttgagtaacc 
agataaacga 
cgaattctga 
gagatttcat 
gacaatgaaa 
tagtcacgac 
aggactcgca 
ccgggggccg 
cagaggagcc 
tcgccctacc 
cgcattcgcg 
gcccggaagg 
ctcacagcct 
ctagaacccc 
tgcctctgcc 
gaaaatgtca 
acacactcta 
atttgctcac 
cccaaccaca 
tggaggactg 
ccgcccccca 
cccatttttt 
ggccttacta 
ctttggcctg 
ttagatgcag 



aaaaccaggc 
cattttgctc 
gacgatgtga 
cctggaagcc 
aaggccttga 
tgggcctcag 
atgatttcag 
aggcatggag 
ggaaggggag 
cattctgcat 
aggcaggccc 
ctgggagcac 
tggccacaga 
ccttctgact 
aaattccacc 
tggctgttgg 
ggatgtgagg 
aattttaggt 
cctggcctcc 
cgaggtaagg 
tgcctggggc 
tggccgggct 
cttacccacc 
tggtcttgaa 
accccagggt 
aaaggtctcc 
cacctcagac 
ccctaattac 
tgagagctgc 
tcctgctcct 
ccttaccagc 
taaagtaatc 
caaagcccag 
ggcccgccat. 
tcacaggcgc 
cttcctttcc 
tctctggatc 
aaaggtagac 
cagcaaatgt 
acgtggcaag 
cttacccaat 
ataccgtccc 
acagaagaca 
caaaagctcc 
gcaggccatt 
aacctggtcc 
ctccccgcca 
cgaggggcca 
cgcctgctcc 
cctctcgcag 
ggctggggca 
cccgcgccct 
agcggtgtct 
tcccctaacc 
cacccagcac 
tttgtgcatt 
cagcttaacc 
gaatcaggaa 
ccccatattt 
ggttctgaaa 
attaatcaaa 
agcagctgcc 
tgtggccgtt 
tctgcagact 



gcaagaagcc 
attctgaggc 
ttggcttggg 
ccaacagccc 
gttctaactc 
gagccttttc 
gactacagtc 
taaaaaaaaa 
gaaaggggag 
ccaaaggctt 
agccagccct 
ctgctctgag 
gcacgagatt 
catctgcaca 
agtggcaata 
ggcggggagg 
ctgtgcaggg 
tttgctaatt 
ggctcaattc 
aaatggaagt 
tgttcctaac 
gtctgactct 
tttgcctggg 
ccggtcaaag 
gaagatgagt 
catgcatccc 
agcttgtcgg 
ccctcaggag 
tgacccttcc 
tgccaccacg 
ctgccctgtg 
ttcattcttg 
gcattccagc 
gagcagaagt 
cgtaagcctg 
cgtagctctc 
ctgtagaata 
tagagttatt 
ggggaataaa 
tagagaggca 
agctccccgg 
ctaaaataat 
caaactcccg 
catggatttc 
ctactgccag 
ccgccgccct 
gcctccgcac 
ggccgcgctc 
gcgccggggc 
cccctcccag 
gacctcccac 
ttttacctct 
ttccctccct 
aagttagttg 
ccagaggaca 
ttgccttgac 
gtctctccca 
gccaagaacc 
tcactcccaa 
gagccttccc 
gacatatata 
ttactatcct 
tccttgccaa 
cccaacccca 
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gctacctgga 
tttaagagga 
caccaccaaa 
gaagggtggc 
tctttggctg 
aaaagaagta 
gtagggtagt 
ggaggaattt 
tctgaaacgt 
gtcttttctt 
acaaacaggg 
ttaatgcatg 
gtgtggaagt 
gttaatctgc 
tgaagcacgg 
cttgactctt 
cactaacaca 
cccatgttct 
tcctgcgtgg 
ggtctgacgg 
acaccttttt 
tgtgccaagt 
ctcttgtggg 
aggcagatgc 
ggaaccagct 
tctaagtgag 
cccttaatat 



cacagaggca 
gttcccactg 
tggtaaggac 
agggagcata 
tttccttccc 
gacagagtgg 
tataagtgag 
ctcaagagca 
tcctctgaac 
tcctaccctc 
ataaagggct 
tcacaatggg 
ttctacagcc 
cagagaaaat 
agccaattac 
tatggattaa 
tactgttgtt 
tagggagaat 
aagtaatggg 
tagatgacac 
gcttggattg 
gatggagtct 
gagggtgggg 
tttctcttta 
cactctggcc 
cagcctggat 
actggaagca 
cttatgaggc 
caactgttta 
cactgacaat 

ggaggcacgt 

tgagcactga 
cgataggtgt 
aactccccca 
ctgaaactaa 
tcaccaaggc 
tcaagtgatt 



tcccctgagg 
agaaatagag 
gcacaaaaga 
tggggtcctg 
ccccacctgt 
taagatgggt 
agattttcta 
caaaggaact 
atgctctcat 
aaaataacat 
cttaagatgc 
ccctatcttc 
agcaagggag 
aggagcctgc 
gacacggggc 
agagcctcca 
agtcttactg 
ttggttcgaa 
ctccaatgag 
tgcacccgat 
ttttttaatt 
gccgggattc 
agagacagac 
tagcacagca 
agagcacagg 
ggcattaaga 
ctctcctatc 
tcatctgctt 
gatggcttca 
cagtccaggg 
gaactggatc 
atcatcagca 
gaagatggat 
ctgcactcca 
ccaccttcca 
aacctcagcg 
tgacatcact 
ttgatttcgg 
ctcatgtaaa 
tgttttggaa 
tctaatcctc 
tggtatctct 
agataatgta 
aacaaatatt 
ttcaaagtgt 
tgagctcagt 
ccaaatgcac 
tcctgtaagc 
tagagctggc 
aactactaca 
acagaggtga 
cctcagagca 
cagcctccct 
ctgggccagg 
tactgctacc 
ctggtaactg 
atctaaacct 
cctacccctg 
ctatgtatct 
tgttattagc 
acatcaccag 
tgcttttttt 
tggagtgaaa 
ctcctgcctc 



gcccaggaac 
gttacgatag 
taaaagtctg 
taccccaagc 
aagacgagga 
gtatgaggtc 
gtctgcagtt 
tgcccacggt 
caccccgtct 
tatttatggg 
aagattttct 
tgctctggca 
aaacagaaaa 
cagaaagatc 
aggtcgtcca 
ctccacatct 
aactgatggg 
ttccagcagc 
gattaatgag 
gtcttcatcc 
gactgattgg 
gtagagaaga 
aaagaaaccc 
cattgaaagg 
gtggtgaaga 
attcagccca 
tatccgtggc 
ccaagccatc 
cttaactgct 
tctggggatg 
tcctgcatcc 
agtctggata 
atgtttaggt 
tcattcccct 
atgcagtccc 
agggcaattg 
ttctaaattt 
atacgccaaa 
ttaaaatgtt 
gtcagaaagc 
ttgaggcagt 
gagcttccgt 
tgaaaactgc 
attgacttac 
tcctaaccta 
ttaccttgct 
tcagtacaac 
tgtgggaagt 
tcttcagggt 
tagccaagta 
atctgaaagg 
gccagctcag 
ttcaccatgc 
cctaagagaa 
caccctccca 
cacttcccga 
gcaactctaa 
agaagaaaaa 
gattcgaagc 
atctattatc 
gtagcagtgt 
tttttttttt 
tggtgtgatc 
agcctcccga 



tccagctatt 
gggacagcca 
caaccaccct 
tactcactag 
caatagtacc 
cctgcatggc 
atgtagacag 
cattctacaa 
taacaaacat 
agaaaatcca 
attttactgc 
tacattttag 
aagtcaaagt 
tagctcatgg 
gggttctgtc 
cccatcaatg 
acaggaaatt 
tagaaaaggc 
taacatcaga 
ttttctcttc 
ttcaacaaat 
gattcagtgc 
aagatttctg 
cacggaacct 
aagtggaaag 
catcaatcaa 
cactgctcta- 
gccattctcc 
caaaaccctt 
ccagccatga 
tactgcccaa 
tccaggatcc 
tagggaaaga 
ggcacaaaca 
tgcctgtcca 
ccactctctg 
gtatatgtag 
cacataaaca 
tggttttcca 
aaaaggtaaa 
gccaaaaata 
ctcctcatct 
ccaattagta 
tggaaaacaa 
aaaaagaaat 
taagaagtcc 
cccccaagac 
ctgttcatga 
atctcctagt 
aatatgaggc 
atacccaaag 
cttcagtgga 
agaaaacaga 
agcagagccc 
agccccagct 
tagactttgc 
ggactaggct 
taaggaatat 
gctttttgct 
tgggccaact 
caggactggg 
tttttttttt 
tcagttcact 
gtagctggga 



ccaagcccac 
gaactgagga 
agtgacttga 
ttatacaacc 
ttaattatag 
gcaggtgcta 
agccagagaa 
agctgcagta 
ttggacatta 
caaaaatata 
aagacacaaa 
tctcctgggg 
aaagagacag 
gctatctgta 
caccttatct 
tctgcagaag 
agaatatcct 
agatgctatt 
gagagaagtg 
gcctccttcc 
acatgtggta 
ctgctctcaa 
gagtgtggga 
caacaaaaca 
atttgaggcc 
tcatgtccta 
tgcagacact 
tgcaagagtt 
ctgaggtcca 
gacattgctt 
gtaccaatgc 
accctatgga 
gggtttgcca 
atggctagta 
cagacctctc 
ggcagagtcc 
ataaactctg 
aacaaacata 
cctacggtct 
tgcaaacatc 
atacaagcac 
ataaaattgg 
cactattaat 
agaggaaata 
aaattagggg 
caccctagag 
tgtctgggct 
gccacagtag 
gtgttcaaag 
ctccttgctc 
aggcactgga 
tgctagaggc 
gctcccccac 
caagccccac 
ccacttctat 
tagaaaggaa 
ggggaacact 
caataatacc 
taatctgtca 
gaggcctaga 
atagaaacct 
tgagacagca 
gcagcctcca 
ttacaggcgt 



tcctcttttt 
ttttccagct 
ctgaatggag 
tgaggcaagc 
gaattgtcat 
taggcagatt 
gcagctctgg 
ccttcccaac 
gagaaaacaa 
gcatcccagg 
gactctgaaa 
ggatcagtaa 
attttagaat 
catccaggac 
tgttacctct 
acgtggcctc 
ctgaaccatt 
ctgatcactc 
attataataa 
tcatcatctc 
cctcaggctc 
ggggctcatt 
atggtcttcc 
ataacattta 
agcgtcgcca 
ttgatttcac 
catcatctct 
tatttccatg 
gtcaactggc 
tgaggggaag 
tggaggtggt 
tgtttttatg 
aagagggcag 
tcctctagtc 
tcctcaaact 
agatattctc 
agccattcac 
agctttcctt 
tgaaaggggt 
atttcacctg 
actgctatcg 
aatcgagctg 
aaatagcagc 
aagtcacatt 
gaaaaccact 
aactgatctc 
taaggcaggg 
acaggaaggg 
cagttctcag 
tggggagacc 
gggtgggggc 
agcagaggat 
cagaccagat 
cacaccaggg 
gttcatcaag 
tgcctcagtg 
gtgtcaacat 
tgctgggcac 
ctgaatctca 
gggacgaagc 
gatgctctga 
tctcactctg 
cttcccaggt 
gcaccaccat 
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104700 
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gcccagctaa 
gtctcaaact 
caggcgtgag 
cctgggaggc 
accagttatt 
aacaagagag 
accacaccca 
actcctgagt 
tgaaaagagc 
caaagggaag 
tagggattag 
gagagcacat 
caagtctgtg 
ggaagtggga 
gcacctttct 
tagccctgac 
gtcgtggggt 
atggtgagtc 
aaaccacgga 
tgggctgcaa 
aaagcattct 
acaaaattgc 
ttctcaatga 
tgagtgctgc 
tgatgcccac 
tgaagaaact 
ggaagagata 
cttttcctgt 
acagaagtcc 
gacatctctg 
ttgttgtgat 
gataaacctg 
gaatcctaag 
gaaatcaata 
ttgacacata 
atatccaagg 
gatcctatga 
ttccttgttt 
tgagacccac 
tcacttctgt 
aaaacatgga 
ctctgcaacc 
tagcacccac 
gaacagtgtc 
nnnnnnnnnn 
nnnnnnnnnc 
ggtctcaaac 
acaggcatga 
cgggaagtgc 
ggattctcca 
aggacagtct 
accagagtcc 
ttccagagat 
cccacaagct 
taacttggta 
gctcacatct 
gagttcgaaa 
ttagccaggt 
aattgcttga 
gggtaggcaa 
ctttattctt 
tgataatatt 
ctatccatct 
aatgagaatt 



tttttttgca 
cctgggctca 
ccaccatgcc 
ttcccaagag 
actaatagga 
tagtgctgcc 
cccacacgca 
gaaaacaacg 
aggaaactgc 
ccggaagaga 
ctggagggag 
gtaggatggg 
gtgaaattag 
gacccagggt 
ctcttggggc 
ttgctgaaat 
gctgcaatgt 
cgccctggga 
agttcccggg 
aaccgcaaga 
gccctccagg 
ctcaggggca 
aagcaccctt 
ttttgggaca 
agccacccca 
gaggcctcaa 
gacttcccag 
gccaatgtga 
tctctgtgtc 
ttatactact 
taagtatagg 
aaatgcaatt 
gctgactggt 
ccagaaggtt 
ggtgaagctt 
cagggagcca 
agatcttacc 
gcttagatta 
tgaagaggaa 
gttgtcactg 
cttggaagcc 
tcgggcagat 
ctcaacatct 
tgccatctgg 
nnnnnnnnnn 
taatttttgt 
tcccgacctc 
gccaccacgc 
ctgtggggct 
tgacctctgt 
gatggctggg 
ttccgataca 
atccatgcat 
tttgacatca 
tctctggtac 
gtaatcccag 
ccagcctggc 
gtggtggcag 
acctgggagg 
caaagcgaga 
tttgatggct 
taagttatgt 
gtaaatgtct 
tctgggtcaa 



ttttttgtag 
agtgttctgc 
cagccagctg 
ctacacagag 
atgaagatgg 
tctctgtgat 
tgatcatgaa 
gggtttttca 
tggggccgat 
ctggaacagt 
aaggagaagc 
aaccccagat 
ggaaggctcc 
ccagtcctgg 
ctccattcct 
gctgggactc 
tcctccagat 
gcctctgctt 
gctgatgggt 
tgggctgtga 
tcttaaagca 
tcagggaggc 
ttgggtatta 
ggcatztgggc 
taaggaggcg 
gagatgaagt 
tatccagagc 
tctgcctcca 
ctcttcagcc 
ccaatggtgt 
ggaaatgacc 
tattaattaa 
cagcacaatc 
ctttgttgag 
gacatacctc 
tctgggtgtc 
tttctggctg 
atttctgcgt 
taatcaatac 
tafccctcatg 
agaccgtctg 
tacctaagtc 
gtcaagagga 
taagcagtcn 
nnnnnnnnnn 
atttttggta 
aggtgatcca 
caggccccac 
ggtgtggaca 
tactcatatt 
caaaccctgc 
tgafctctggg 
gcatcggcat 
atagtagcat 
tgctcaatat 
catfcttggga 
catcatggtg 
gcgcctgtaa 
cagatgttgc 
ctctgtctca 
gtaggtggat 
cttcagcatc 
ttttgaacat 
agaggatatg 



agatggggtt 
ctgccttggc 
atgctcttaa 
catgagttct 
gtaattcttt 
tctgtgtctc 
aagaggaaat 
cattgagagc 
tgaacactgg 
ttccatggtg 
tggggtggga 
gatactcaag 
actgcggact 
ctctggagcc 
acctctgtga 
tgtacagagg 
gctgcacggg 
ggaagctgaa 
tcttatagat 
ccactctcaa 
aacacagact 
agcaggcctc 
ataatgacaa 
taagagctat 
gaggtactat 
aacttggcca 
ccatgttttc 
ggaatcctgt 
tgatagtata 
tccctcccct 
cacactaaac 
cactgagaaa 
tctttcagga 
tacaaagtca 
tataaagcct 
ctctcctttg 
catttatcat 
atttaataga 
atactagttg 
cttccttata 
ggtttgaatc 
agtttcccct 
ttcaatgagg 
nnnnnnnnnn 
nnnnnnnnnn 
gagacagggt 
cccgcctcgg 
acacctttta 
tgtgggtgaa 
cccacactcc 
ggcaaaccat 
cagctgttgt 
atgtgtataa 
attattaaat 
aagaacatat 
ggctgagatg 
aaacccccat 
tcccagctac 
agtaagccga 
ataaaaaaaa 
gttctaaaat 
atatgaaact 
ttctagaata 
cattttacat 



tcgccatgtt 
cccacaaagt 
tctgtgccct 
ggaatcgggt 
cagacagcac 
cctgccctgc 
ggatccagga 
tttgcccaac 
acttttgttg 
ctggaggatg 
ggggaacctc 
gcatggcatt 
gtagacagag 
ccctgggtgg 
agcgagtgct 
ctgacattaa 
agagggcaga 
gtggcctgag 
tgtacatgca 
ggaaagagcc 
caatccttat 
aaatgtgtgc 
cagtaatgac 
atgtaatata 
cattatgcca 
agtcactcag 
accattatgc 
cttgatgttc 
tcttttcata 
acccctccct 
aaactcataa 
tgaaaccacc 
aggacaggct 
gagggaaggg 
ccatcctgcc 
gacagtgctg 
gattgtggaa 
actgaaaggc 
tgttgccctt 
atggagggac 
ctggctctgt 
tctctgaatt 
gaatacacat 
nnnnnnnnnn 
nnnnnnnnnn 
tttgccatgt 
cctcccaaag 
aacaaccaga 

ggtgg cac tg 

tcaaattagc 
tccccagccc 
tacccgtgtc 
ttattatatc 
tgttctgtac 
agacctggcc 
ggtggatcac 
ttctactaaa 
ttgggaggct 
gatcacgcca 
aaaaggaatg 
ttgtgtaacc 
tacaaacaag 
attgcaggat 
ttaatagata 



ggccagactg 
gctaggatta 
acccagcctt 
tgatgggggt 
ccttgattaa 
tcacacagac 
gaaggagacg 
accccaaaga 
tggaaaaagg 
gggaagtggg 
cacttgccag 
agaccagaag 
cactggacaa 
gctgccccgg 
gaacctctct 
gcagggatct 
aaaggcctat 
agtgactcag 
gctctcctcg 
ctatctgcaa 
tccttttaag 
ctttctagaa 
agtcatttac 
tattattatt 
actttaaaga 
ccagtaaatg 
tgaagtacct 
ccttccccat 
ccattctttg 
gggagcttag 
gagactgatt 
cagcagatgg 
tttgggaaag 
agttgatgga 
aaggatcaga 
gatttttctg 
ggctttttgt 
aatttcccat 
tgcagagaat 
agagatggta 
tacttataag 
ggggatataa 
aaagtgctca 
nnnnnnnnnn 
nnnnnnnnnn 
tggccaagct 
tgctgggatt 
tttcattcat 
ggagaagtta 
ctgagtctcg 
tgccctctca 
ctccatgttc 
tatatttcat 
attattttat 
aggcacagtg 
ttgaggtcag 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
tatagacctt 
aatctcctat 
gttgcattga 
aaactcctaa 
tttgtcaaat 



108480 
108540 
108600 
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109140 
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110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
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111120 
111180 
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tgtcttccaa 
tttccccaca 
gaagctgttg 
agagaggtta 
cgatagcaga 
gaatggagga 
ccaccctgaa 
agtatttgga 
gaacattata 
atataatgta 
tttggcaggc 
catagcgaga 
atgcatgctt 
gagtttgagg 
aggaagacca 
aaaagcagag 
ctgttactta 
aacatataaa 
gagataatgt 
ggaaattcat 
ctttgatctg 
aggtataggg 
accaggcagg 
cccaaagggg 
tttatcgcgc 
gcttttacaa 
tttaaaatta 
tagcgataca 
tcagctcaaa 
gtaaaattag 
gataatgttt 
gaggatgatc 
caggctggct 
actgcacact 
agggtatatg 
acaagatttt 
ctaggaggac 
gtctcaaaaa 
acgtgctcac 
tcatcacagt 
tttgttttta 
tcttcaagta 
catattctcc 
tgcaaafctct 
tttctgcata 
aactgcagtt 
gcctggagct 
ttagtcagtc 
tccagagttt 
agtgctgggt 
cacacggctt 
tgtttactct 
ctgttgaagt 
ttaattgctt 
aactacaccc 
gtcactggcc 
tcccccacta 
ccatttgccc 
agtggcttca 
cagaaatgag 
ggtgcctata 
cagagtctct 
tcacggccac 
ttacccactc 



agtggtcgta 
ccctggagag 
gccgttatat 
gttagtgggt 
ggagggtgac 
cttaaaatat 
tgtgccagat 
tgggagaaat 
cattctatat 
tcagtaaaga 
cagggtggga 
ctctgtctcc 
atagtcccag 
ctgcagtgag 
tgtttctaaa 
tggtaagaga 
cccatcgtgt 
atagaagtaa 
atataaagct 
tgttgtagtg 
gatttggctc 
tggtctgagc 
tgctgataag 
ggtgtacaaa 
actgaaaaac 
aaaatgctaa 
ctaagtcata 
ttaatgactg 
ttctgcaatt 
gcaggagggc 
gtttttatct 
tggctgtgac 
gggcaggtgt 
tggtcaaaga 
agcacctgtt 
attgtgcatg 
tgcttgagcc 
aaagaaagaa 
ttctctcttt 
gagcacaggc 
cagaactcat 
aaaggaaaac 
agactttgtc 
tttatcctga 
ttctttataa 
ttcttaacca 
tttttttgtc 
aaaaagtaat 
tgcacactgc 
gatggtgttt 
tcatttctat 
ctgcataccc 
cttgaggctt 
ccagccagat 
tggacagatt 
tactgtgfctt 
ccagtagcag 
cacctcctac 
acccacagaa 
aaaagcgttg 
aggaaatggg 
tgccccatga 
tctaggcagg 
tgctgaacac 



ccaattaaca 
atgaaaaatt 
aaccctcgtc 
gcaaacatac 
tatagttaac 
tccaacacat 
cttgtttgtt 
gataaatgct 
atgtaacaaa 
gagggctggg 
ggatagcttg 
acaaaaaata 
ctacttggga 
cctacgactg 
agaaataaat 
aaggactttg 
gataagcaaa 
aaatcatacc 
tctggaataa 
ggcagccttc 
aggcaagacc 
ttccccagca 
accggaatgg 
acgatctcgt 
aaaaccaaca 
agtaggcatt 
gtacataatg 
aagttcttta 
acacaggctg 
taacctcgct 
cacagagttg 
acctgtcacc 
cccctttctc 
ggaagacctt 
ctgtcctgcc 
gtagcatgag 
caggagtttg 
agaaagaaaa 
ggaatgtagt 
taaagcatct 
tatgttgtga 
cacccatcat 
aacatgaata 
cttttataaa 
ttatcatttt 
tttcactgtc 
tattgcatta 
aatattttca 
tcccagagat 
ataaacttct 
ctctttgatc 
tcttgggtga 
gttttataaa 
ttggccactg 
aaattaattg 
gatcctatag 
gaggctccat 
aataccccac 
tttaagggga 
ttcacatggg 
ggagggttgg 
tagagggttc 
gcttcccaat 
agatgttacc 



ccccgacctg 
tatgggccca 
attaataagc 
aattagafcag 
aacaatgtat 
agaaataata 
cttggaagct 
tgaggtgata 
atatcacacg 
cacggtggct 
aggccaggag 
aaaataaaaa 
ggctgatgca 
caccactgca 
taaataaaat 
gggctcaaca 
tgccttaacc 
cacttcaagg 
tgcctggcac 
tgaatctgtg 
tggggaaggg 
gagtgaggct 
gaggctggag 
atgactcctt 
tatttaatga 
cacatgttta 
tgtgagccac 
aacatcaaca 
gggttgaaac 
gaatctcaga 
ttgtgaagat 
ccactgatct 
cctcaccact 
tcctgataga 
agaatctccg 
cctgtaatcc 
agaccagcct 
gaataataat 
aagtgtacct 
tgactttatt 
aaataatttt 
tctcccaacc 
cttactttac 
caaatatgat 
gtggctgcat 
tggggaaatg 
tattcttaag 
aggctcttgt 
gtaggaacac 
gctaatttaa 
attaacaggt 
agtagtcatc 
tgtgagcgag 
aggctcctga 
gagaaaatgg 
tgggggcctg 
gtgtccccca 
atacatgtac 
aaggaattgt 
atcacctaat 
agagttgtgc 
tcaaattaag 
gcttccccaa 
catagcacct 



taatgaatga 
ctttggagtg 
ctgggggtgg 
aagtaataag 
tgtatatttc 
aatgcttgtc 
aagcagggtt 
gatatcctaa 
tatcccataa 
cacatctgta 
ttcaagatca 
cgaattagcc 
ggaggattgc 
ctctccagcc 
aaataaaaat 
gtactagcct 
cctgtgtgcc 
gtcattataa 
acagtaggag 
tcctctttgt 
cagagactga 
gggaaaggtc 
cataaggcag 
tatactgtta 
atgattccaa 
aaaattgagt 
agctatcccc 
tacaatgcca 
ccagcttttt 
tgtctagtct 
tcaataaaat 
ccagagttga 
ccgcatgcat 
ggaggaccat 
aaggagctct 
cagctactca 
gcgcaacata 
agtaataaat 
aataaatgtg 
ctataagcaa 
ccaacattaa 
ttcaataatt 
atggtcgcaa 
gttataaacc 
aatattgcat 
gaggataatg 
atcaaatccc 
tatattttac 
agacgtcatc 
taagtatgaa 
tgaactattt 
cacttccttc 
cacttcagag 
gcaggggaat 
gctgagaggc 
aactggggca 
tattagagct 
tcactctccc 
tctgtcctgt 
gaagggatgc 
aaaatgcaac 
agagtctaca 
ccccacctcc 
tgcaccatga 



gagtgccttt 
catggtggag 
ggggggagaa 
ttctaatgtt 
aaaatagcta 
tgcggccatg 
gaacctggtt 
ataccctgtc 
atatgtacaa 
atcccagcaa 
gcctgggcaa 
aggcgtggtg 
ttgagcccag 
taggcaacag 
aaaaagactg 
tgaaccctgg 
tcactttctt 
aaagccaata 
tttaataact 
ccactaatgg 
gggcaactgg 
tgggagacag 
ttcagttttt 
atgttttcat 
ggggattctt 
tgatttaaat 
aaaatcatga 
attccagaat 
tgctaactgt 
gtaaattgaa 
cacaacatgt 
trtcggctgat 
tcctcccgaa 
tcttcagtca 
cagtaaaatc 
ggaggccgag 
gtgagaccct 
cacctgtgca 
atcattgtaa 
taaaagagga 
caaagaacat 
ttcaattttg 
tcagtgttca 
ggtctccatg 
tgactatgtt 
ccagggtcat 
agcagtgaga 
tagattgttt 
caaccttgcc 
atgctatcct 
tccaagtatt 
acctgtttat 
tcaatagaca 
gcatgatcaa 
agagatatgt 
acggcctgag 
tgcggcactt 
ttgcaaatct 
tcacttactg 
catccccaac 
agggaatcat 
gcaactaatc 
atcctagact 
ttgtttgatt 



112320 
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112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
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113700 
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114000 
114060 
114120 
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114240 
114300 
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114480 
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115140 
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agcacctccc 
ctgcagggct 
tcagcaagtt 
aaacaatggg 
ggttaagact 
agcctggctt 
ccagctgccc 
acagcctggg 
cctagcctcc 
ctaggttcaa 
gccaggcccg 
cacttgagga 
aaaatacaaa 
tgaggcagga 
attgcactcc 
gcagcttccc 
atgcacatga 
ccccttggcc 
agcccacaaa 
ctaagtgtgg 
cttcttgccc 
acagggctca 
ctgtttctcc 
atcagccagc 
tataaataat 
aggaggatgg 
aaaaatacaa 
gctgaggcag 
ccattgcatt 
catacataca 
accctaaaaa 
aatttaggtt 
ccaggcagaa 
gtccctccac 
gcctggtgac 
aatctgattt 
tgggatctag 
ctggatgata 
ccttcaatga 
tcggaggcag 
cacagtgggc 
tagtacacgt 
cactccttat 
tgggaggcca 
tagtgagacc 
agctacttgg 
agccaggatc 
gtctcaaaat 
caatgcctat 
accaaataca 
aaacaactct 
ttatctatgt 
agtcaccagg 
tttattgatt 
tcgacagggc 
ttgctgggat 
actggggttt 
gagccagggc 
acactaaccg 
ggaggttctg 
cattgagagt 
gtgccaataa 
atggggtggg 
tctccagaga 



acagtagact gtgtttctga taggtcagca acatttgctg 
gtgccaggtg cacaaaataa acaaagccaa agacaacatg 
cagagtcaag tgggataggg aggctctctt cactggaagg 
actcaacttt ctaaccaaga gaactccagg gagctaaaat 
ggtgtggagc ttcattaaag aagaaaagat tcacccagac 
tgcagctttt aagtcatgta acctttgatg aagttatgtg 
ctaacacctt gcaggggcag ggctggagtg caaagggagg 
aggcaccacc ccactagtgc aagccgggca acctctgccc 
caactgcaag catcaatctt gcacttggaa aggaacctca 
atttagaatg atccagctcc ttgaagttct atacagaaat 
gtggctcacg cctgtaatcc cagcactttg ggaggctgag 
cagaagttcg agaccagcct gaccaacatg gtgaaacccc 
attagccagg catggtggta cacgcctgta atcccagcta 
gaattgcttg aacccaggag gcagaggttg aagtgagcca 
agcctgggca acaagagcga aactccatct caaaaaaaaa 
tccacttccc aaccacagct ccatctcaga caacaagggg 
atatccaacc aacatgtcta aggcccaacc acaccctctc 
accctttggc catgggttca tgcactggca gaaaggtagt 
gggccgggaa gtccacttgg gctttttgag attccagggt 
tctagaagag agatgcagct tctgggaggc acattccttg 
ccatggaggg aaactggcta gatgagggcc aaagcagagc 
gggaaggact ctttttgacc agatctaaga gcagcactac 
atctgtaaga aggggacatt aatagactct ccccgctaga 
acacgtaagt tcatgacatg aagcaagggc t.taatatata 
aggccaggcg tggtggctca cacttgtaac cccagcactt 
atcacaaggt caggagtttg agatcagcct ggtgaaacct 
aaattggccg ggtgtggtgg cgtgcacctg tagtcccagc 
gagaattgct tgaacccggg aggcagagtt tgcagtgagc 
ccagcctggg tgacagagca agactgcatc tcaaaataaa 
tacatacata catacataca tacatacata catacaatac 
tgagacaggg aaagagaaaa acatgttctg acaaccttgc 
ttcttgcctg ttttagaaag ggcctggaca ggagccctgt 
caaggtgtgg aactcactgt ggaagggttc tgggtgacaa 
ctcccagcac agtaggcagc acgtgtctcc attgactggc 
cagtgggaga gctgaggagc ccagggtggg gtctgaagga 
tcccccaggg cccacatcac gtgcccagag ctgggaaagt 



ctgagaggct ccatttttgg tagcttctag tttgggagtc 
cgaagatgta gctttgcagg actctctaga acatggagtc 
tgggacactg aagcccacag aggagaggtc tgtcccagtt 
agaccaggct agaactcagg acttttaatt tggaccagga 
agccctagca agtgccaggt agggtggaac tgtgaaggtc 
aagtaggaag tcatatctaa gaactgaccc ccagacctgg 



yyy ^"y^^* ^ — - 

gagaccacag gtgctgggtg cagtggttca cacctgcaat 
aggcaggcag attgcttgat tccaggagtt cgagaccagc 
cccacctcta ccaaaattag ccaggcgtgg tggtgtctgc 
gaggctgagg tgggaggact gcttgagcct gggaggcgga 
atgccactgc acaccagcct ggatgacaga gtgagacaga 
aaataaataa atgacagcag atcatcattt ttctttctgc 
ttctccaggt agtcactagg ataaaagtaa aaataatatt 
gggtcagcta ctctgttatg ttctttcatg ctttgtttct 
atgagctggg aacaagtatc gtccttcttc ctcccatctt 
atttatctat ctattcattt atttatttat tttgagacaa 
gatggcctca aacttgagct aggaactagt gtcaccaccc 
gattgattga ttgattgact ggttaatttt gaggcagggg 
tggtcttgaa ctcctagtct taagcaatcc gcccgcctca 
taccaacacg agccaccatg cccagcccct cccatcttct 
gaaaaggtaa gcgacttgcc caaggtcccc tccctagcta 
acaaacccat caaagcctgt gctctcgccc attgagccac 
ccaagtgttc tacacagtga aggtgacaaa gaggtgaagg 
tggactcact cggtgggtat gcccagaggg aagggggatc 
agctgcgctg ttagtaagtg agaactcgga agtccagact 
acccctcctt ctacctggtc tcctttccaa agccagctgt 
cgggggcggg tgtcctcctc cttctcaggg aaaatccgac 
tcttggcttc ccgtggggct gcagatccac ctagagccac 



agcacctact 
gaccctgaac 
taactccaag 
tctgacttct 
ttgagttcat 
acctctccac 
cactggtacc 
ccaaggcatc 
cctttgaaat 
acagccagca 
gtgggtggat 
gtttctacta 
cttgggaggc 
agatcgtgcc 
caaaacacac 
cctcatgtcc 
caaacatctg 
tcagagaaga 
ccaggataac 
gtcttaggga 
cctctaaagc 
ctctctgagc 
gttactctac 
cccgttgtac 
tgggaggccg 
catctctacc 
tacttgggag 
caagatctca 
taaatacaca 
atggacaggg 
cctttatact 
tcccctcagg 
gtgcagcccc 
tcaggagcag 
atccctagaa 
ggaggcagca 
acagagacac 
caagatattc 
actcagccat 
ttccttttac 
atccgagggg 
ctctgccact 
cccagcactt 
ctgggaaaca 
ctgtagtccc 
ggttgcggtg 
atgacacact 
ctctagactg 
atcagcattt 
tttaagcctc 
atttatttat 
ggtccttcta 
cccaatttct 
tctcgctaag 
gcctcccaaa 
gaataggaaa 
gagagcttca 
cggacctcgt 
gaagagccag 
ttgggtggca 
catccagtct 
tctccagaca 
gctgagccca 
cagagggcgg 



116160 
116220 
116280 
116340 
116400 
116460 
116520 
116580 
116640 
116700 
116760 
116820 
116880 
116940 
117000 
117060 
117120 
117180 
117240 
117300 
117360 
117420 
117480 
117540 
117600 
117660 
117720 
117780 
117840 
117900 
117960 
118020 
118080 
118140 
118200 
118260 
118320 
118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
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gccagcactg 
ccagtgtcca 
agcccaactc 
agaattactg 
acctcatttt 
cgatgaccgg 
attgtctagg 
ctccccccaa 
gtaagagggg 
atgagaccag 
gctggaaaca 
tgggggtgga 
gcggggatga 
ctttcctcgc 
cttgggggcc 
gacagcacac 
gtgatatttc 
acccagcgac 
acccccaaca 
tcggggtggg 
ctgggtgagg 
acgagagggg 
ccgaaaatgg 
ggaatgaaac 
acgagagaca 
gacagtgaga 
gagagagagt 
cgggggagaa 
agcagagaat 
cactgcattg 
aggggcttct 
cacctgcggc 
gctcccccga 
gagtcaagac 
ctgctgatac 
tccgcccctc 
gctttgggtc 
acatacacaa 
cactaagggg 
gcgcaaaacc 
ttcagcggct 
caaagggtaa 
cacgctcata 
gcacactggg 
ggacagaaat 
gggtgaccca 
cggcgttccfc 
gcgaggacga 
gagggggtta 
cagcaaacga 
cctcgctgtc 
gactctcgag 
cagccttccc 
ggcccgcagc 
tcccgggccc 
gagtgggagc 
gggctcaggc 
gggagagaag 
ggtaacctgg 
tctcagcagg 
ggtgggacgc 
tgtccccgct 
actcagcaac 
atgtgggtag 



cggccaaggc 
agagtccagc 
atgttcaggc 
ccaaaatagt 
accaagcact 
gtagcctcac 
gtcacctgga 
ccctaaccgg 
tcaga'gtgct 
ccagccaccc 
cgacaatgga 
gggggtgagt 
gcagaggggc 
agctcgtccc 
cggggtcccc 
ctgagtcagc 
gctccccggg 
agattgtgcc 
ctccctcccc 
tcctggggaa 
gaagacggga 
cgggaaagga 
aaccagcagg 
tggggaagag 
gagacgcagg 
aagacagaag 
gagggaggga 
aaccagagaa 
gaatgggaag 
gaccctggca 
gtgctcgcgg 
tccagcagcc 
gggccccgag 
ctgggtcaac 
cgcctgtgac 
tttaaaaggg 
ctctcctgca 
aaaaacgcac 
catacacaca 
tcgcacagcc 
cacccgtgtg 
cgggcaagca 
tgcactcgat 
cttgaggtct 
gctggaggct 
gaagctcatc 
ctaaactacc 
tcgaacacag 
gaaagccgct 
gggaggagcc 
cccactctcc 
cggacagcgc 
cgccccgtcg 
ccgggcggcg 
tcccggagcc 
gccctccccc 
gcctgctgca 
cagggcgtcc 
ggaaggggaa 
agggcacttg 

agggggcagg 

ccacctgctc 
tcctcctgcc 
ggggctgtct 



ttgaagaccc agcacaccaa 
cagaggccga gtcctcgatc 
atgatgtgtc tgattctact 
aacgacatta gctacctacc 
ttctcatgcc tggggtgcct 
cgtacagatg aggaaactga 
gaactctgat ctccagactc 
agctaggtgg ggtggggaca 
ctacagagaa gaccaaatgc 
acccaccagc cagccaccta 



tgacacgagt tccctgccct 
cagcagataa ttatgggaaa 
tgcgactgcc tggagccagg 
aggaggagga gctccccccc 
tcccacccct ccccgaagag 
ccgccgccca cccgcccctc 
agccagcccc actgcgctcc 
gcggctcatt ccggggaagg 
gccgccgcct ccaggccctc 
acgcagggtc ctgtcctgcc 
gccccagaga cttttctttc 
aatttagggg cacagagagg 
gggcacccga gagccgaggt 
ggggagagaa agggaggcag 
gagccccgcg gggaggagga 
accgggcagg gaaacagacg 
gggaacagag agacagagac- 
atcgaatgag aacgcgagaa 
aataagacca acatttatca 
cggataggaa aggaggagcc 
gagcggcagc ccagggggct 
ccaaccccgc cagcgctgcc 
cctatcctac gccggggcgg 
cgccctgcag cctttgtagg 
caggccatga agggccagag 
ggtagaggaa gttcacgcga 
cccccgcggg agataaggtc 
acactcgcac gcgcgcccat 
ccgggcacat atttctttcc 
tcacgtttcc caccagctca 
cacactcacg tgcccccccc 
tcctgagtca cacctgcaca 
cttgcacgca caaactcttg 
gggagtggaa ggaaaagtgg 
gggacactgg cgcgagggac 
ttctcctgga aagttgggag 
gcattccccc aagaagggat 
tcctccgggt cgcttaagcg 
cccgcctcct agtggtcgag 
agcgagtgcg ggaaggagtg 
ctcggctagc agcctgggca 
agctagcggg gcgcgggcgc 
cgccccgccc cgtcccgtcg 
cagggtagag cgccgcggcc 
ccgcggggtc cccgccgtgc 
cgctgccccc tcccccgagc 
aggtaagaac gccagcggcg 
cctctttcag ggattgaggg 
aagctcagcg ctggggccgc 
gctgggagcc cgcgggcgcg 
tggacccggc ccggagcggg 
cgggggacgc tgaggactcg 
cggtgcctca gcactttctg 
ggggaggtag gaggcgcaga 



agcccggcca 
tcaaaatgtc 
gggacaatca 
acccctccac 
ggagacttaa 
ggcacaagga 
aaatttccaa 
gcaaatgtgg 
attgtggcac 
aaagtcttca 
caaaaagctg 
ccgtgacacc 
gattcccgga 
agcttcgggg 
cgcgggcccc 
agcgtctgtc 

gg a gg ca g ct 

acgccaaacc 
cccccaggcg 
tcctggaaat 
tgtttctacc 
agctgggggg 
gcccacgggc 
agacaccgag 
gagagacgaa 
agtagagaca 
cacgaaatat 
gaacgagaga 
agagccgact 
gcggcgcggg 
cagcagcccc 
tggccaccgt 
ctccgcggac 
gaagtgccta 
gggctccagt 
agccaacagt 
tcccctcccg 
ctcgcacccg 
acccatcccc 
gacatgcacg 
cccgcttccc 
agcatccttg 
catatactat 
aatcttggag 
gcggctgggg 

ggggg aaca 9 

ttctctagaa 
ggggggaggg 
aaagggttaa 
ggggtggHtg 
cacggacaga 
tgggcgtcga 
gggccgatgg 
cggccacgca 
atccggcggg 
atcgagacaa 
ggagagcgga 
tggggcagtt 
gcccccgccg 
tgcgaggagc 
gagggaggct 
ggccggctgg 
gccacctggg 
gggaaatcca 



agcctccagc 
taactgcaga 
ttgccaccaa 
acaccaacac 
tgcagcctcg 

a gggg a g tac 

ttcgtccccc 
atgggggg a g 
ctactgtaaa 
gtgggcacct 
gtagtctagt 
tgtataaggg 
cggggcttcc 
ttccgcctgc 
gggaaccgat 
tccgcatctt 
cggcaaacaa 
ccaccctgct 
caggccctag 

aggggg a g cc 

tgatccgaaa 
ccgagaaggt 
cgggagcctg 
acacacagag 
gacacagaga 
gaaaaggtcc 
gagtaagagt 
ccgtggaggg 
gtatgccagg 
cagcggggcg 
ggcaccgccg 
acccgaagcg 
gcgccgggcc 
ggtgat'gggt 
gagaccataa 
cttctcccca 
gacacatcat 
cttgtaaatg 
aagatcgcaa 
ctggcggact 
caagcccgta 
cgcgcacgtg 
tcttatagtc 
ctgtcccagg 
gcggggg a gg 
gacaagtcca 
gagtggcgcc 
gggcggggtg 
gtcggcaagc 
ggaagagctt 
cggactgacg 
cggccagccc 
ctcctcccga 
gcccggggac 
ctcagggagc 
gatgctgccc 
gggcatcctg 
ggggaggtgg 
ccagggctgt 
tcgtgaccga 
caggttccgc 
ggaagcgccg 
aagacaggag 
agtggccctc 



120000 
120060 
120120 
120180 
120240 
120300 
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121920 
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122160 
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122400 
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122700 
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tctggtagga 
aagggtgtgg 
aggagaactc 
gggggagggg 
ggtgaagacc 
agactatgga 
gggaccctgg 
gggatggtga 
atggatggga 
agggactggg 
tccacgtggg 
gtggctggtg 
gacagagggt 
gctcctgtgc 
accacggtgg 
cagctctaaa 
accaccacct 
tgcccgcagg 
gtctgggacc 
atcctttcct 
ctgcccccta 
cggcccaaag 
ttgatagcca 
cttcagcctt 
ttctggccat 
tctagaacaa 
ttccttgcct 
gcacctgggt 
cgtggtttct 
tgggcttgat 
ggccactgtg 
ttgacctcca 
tgcagtgcct 
ctgcagctct 
aggctgctac 
ctgagaaaga 
aactcacact 
taggtggcca 
ctccaccggg 
ggaggcctgg 
gggaagggac 
ggggagagag 
ccctggagtg 
cctcccctcc 
gttacagctg 
gcagcagctg 
ccactatcat 
aggagagtgg 
ctagggagag 
cggaaggaac 
ctgtctctcc 
gggctagcgg 
ggatcagaga 
ggtgactctg 
tccctgccaa 
tcatcacctc 
gaaatctcca 
tgtgcttcca 
accccacgtg 
aggggtgaga 
tgtcactgag 
acgcggagga 
gacactcacg 
ccgcacctgt 



gagatggagg 
gccagagact 
aaagatggag 
gagaaggaat 
aaggaagagg 
cagggaccca 
ggactggaat 
ctagagaagg 
gaggggctgg 
gattgggggt 
aagccctcca 
ggccagatgg 
gaggactttg 
tgccatttat 
ctgcttattt 
cctccgtcac 
ccatcccact 
ccccagccct 
ctgtgcccct 
gggaccactc 
ctttgggtcc 
agtggcctgt 
gagtcctgga 
ccgggatcaa 
atattctgga 
ccgcaactgg 
tgaccctgaa 
gcaatttctc 
gcggaggatg 
gtccctcttt 
tcatgtttct 
ggggattatg 
aggacaaaat 
agccatcccc 
aaagggcttt 
caatgaaggc 
tcccctcccc 
ccgaggtatc 
aattcacaaa 
gcctggaggg 
gtggcaagaa 
ggaaaggcac 
ctcgctccag 
tccttgcctc 
ctttgcacgg 
ttatggagga 
gggctggggg 
ggaaagagaa 
aaagaggagt 
cctgatgcca 
tggggcagcg 
ccatcagggc 
aggatttggc 
acaggaaggg 
cactttgctg 
agtagaactg 
gttctaagcc 
cactccacct 
ggtctaagaa 
agggccgaga 
tattacatca 
cagccagcac 
gtggggccca 
gtgctccccc 



gcgctagaaa 
gggggtgggg 
aaagtggtga 
aaaggttaga 
gaaggagagg 
gaatcctggg 
ggaaaaggag 
ggtatcagga 
agaggaggga 
gctgcccagg 
ttcccaaagc 
ggggtgcttt 
gaaggggagt 
ttatccggcc 
tggggtgtta 
ccgcgtcagt 
cctcagctcc 
tccctgaagc 
ccttccagag 
tgcaccagcg 
agttttcttc 
gggctgagaa 
cagtggtccc 
taccatgttc 
caggggtcat 
aagcctagga 
tagccaaaca 
tcctgtcttt 
tggccacctc 
tccaatctgg 
ggggctccct 
tgtgttgtgt 
aggtgcttct 
tgctgacttt 
cctctgtcca 
gagctgtaga 
agccctcaat 
cccctttcca 
atgaaggcca 
aaggagtggt 
agaaaagggc 
agctctctag 
ccccagcagg 
cagtgagagc 
cagtggcaag 
gcccccagca 
aggtgagccc 
gagaagtagt 
aggcaccctt 
ggggaatggg 
gtcagtccgg 
tcacactctg 
ttcttttggt 
aagtgccctg 
tgtgaccttg 
atgcttgaac 
ccaaacctca 
ctgcctggtt 
actcgggagg 
aacgcggagg 
cacccagcct 
gcaccgacgc 
aaagcgagga 
ctctggagga 



gaggatagtt 
tggggagggg 
gggaagctca 
tgggaaaagc 
aaagaccaga 
atggaggtag 
aatggaaaga 
accggcgaag 
agaggaggag 
gatgagacaa 
gcctgcctgc 
gagctcaggg 
gacagcctcc 
cggacgttgg 
cattctggca 
gcctccccag 
cacctcctca 
agcccgttgg 
cgagaggcct 
actctgccct 
tcctcaagtt 
ctttgtttct 
tcagtgaaca 
tggcctctct 
ctcttcttga 
ggtgaaagtt 
gactcagcat 
acctcaaggg 
ttaaagaaag 
gttccacctt 
tccttcatct 
tcaatgtata 
tggtttcctc 
agaggagggg 
tgaagtagtg 
gacctggtca 
ggtgttacct 
gctctgatac 
ggagtggagc 
ggtgctgagg 
acacactggg 
tcccccaacc 
cctggggcag 
cgctgcgtga 
ggccagaaat 
ccgggtcgct 
tggttggggg 
ctaatgtggg 
gccagctcct 
ccttgcctct 
aggctggggg 
gtgagcttgt 
gtcaagtcct 
gctgggcatc 
ggtaagtcgc 
cagaggaatc 
accctcatga 
cagatgaggg 
agaaagtaat 
tggtgtgaac 
acacacgcac 
agcaccgacg 
gcagcacact 
ggaacaccag 



ctactgattg 
tcagggggag 
aaggaggagg 
gtggagggaa 
tcaggggagg 
cgggaaagag 
tcagaaacca 
agggttggag 
gaagagaaag 
agaggcttct 
cacatttctt 
ccctgggggt 
gagggtgggc 
attctgcagc 
gagtgagaag 
gcccctgcgt 
gcccctgccc 
gtgtggagcc 
ctgctgcctt 
gtgggtgggt 
ccttcttcta 
gagccttggt 
gatacttttg 
tggctccctc 
ctcccacatg 
gcagagagag 
tgtggctggc 
cagtgtctca 
atcagagtgt 
gtactagctg 
gcaaattggg 
aagaagttaa 
ctaccctgct 
gtgagcagag 
gagggatgaa 
ggaggcctgg 
atgatgtgag 
tctgtgcatc 
cgtggtcctc 
aggagtgaga 
cagggcaggg 
ccccagtccc 
tgaagcccag 
attatggatg 
ggcaacagag 
cttcagagag 
agacatggga 
caggtgggga 
gcagagttta 
gagattgcac 
aaagctctgt 
ggataagggg 
tagggaagtg 
aagagacttt 
ttgctctctc 
gaggggacct 
aacccactca 
gtaagagaca 
cgtgaaacgc 
gaatggaaca 
ggggcccggc 
cagcgccagg 
gggagtgtgg 
ggcagctggg 



agtgacagat 
agggatagga 
gagatggagc 

g t ggg accca 

gatgggaaga 
aatcaggact 
gagaaggatg 
acagggaacc 
gctgagagag 
ggtaaccact 
ctctcaggga 
ggctgtgagg 
aaacaggctg 
cgctgccgcc 
ctgtttgcag 
cactggcatc 
cctcagcatc 
cttgcttctc 
tccagggagc 
agcctggatc 
caggggcctc 
actccaaggt 
gctctggaca 
ccctggtcag 
taatcactac 
ctggagtccc 
ccagccctag 
cacattcagg 
ctctctgaca 
catgacctga 
ggccacaata 
cctgtacaaa 
gtactctccc 
agggtggggg 
atgaaggctt 
ggtgctcaga 
gggtcggctc 
ttgccccagt 
gggagagaca 
acagggggtg 
actgagggcg 
accacctctg 
agccccctcc 
agctccttgg 
tcactgttat 
cctgcaggga 
acaagatgga 
gcaggagagt 
ccctcaaggc 
atccttccct 
aatcctccag 
taggattaag 
gagatcagag 
tctggccctt 
tgagctccag 
ttgcggcttt 
gggtccccac 
ttgctcctcc 
cgcacggggg 
gcagccgctg 
gctcacacac 
aggggccggg 
atcttccacc 
atgccagcgc 



123840 
123900 
123960 
124020 
124080 
124140 
124200 
124260 
124320 
124380 
124440 
124500 
124560 
124620 
124680 
124740 
124800 
124860 
124920 
124980 
125040 
125100 
125160 
125220 
125280 
125340 
125400 
125460 
125520 
125580 
125640 
125700 
125760 
125820 
125880 
125940 
126000 
126060 
126120 
126180 
126240 
126300 
126360 
126420 
126480 
126540 
126600 
126660 
126720 
126780 
126840 
126900 
126960 
127020 
127080 
127140 
127200 
127260 
127320 
127380 
127440 
127500 
127560 
127620 



59/127 



BNSDOCID: <WO 0229059A3JA> 



WO 02/029059 



PCT/US01/31488 



cacactcggg 
gagcacctgg 
gcaacctcag 
gccttggaga 
cccaaggagc 
agcctaggac 
cctggctttg 
aggagtcacc 
tatcagctat 
tatgagtaca 
catcagtatc 
aaaaaaaaaa 
gccaaggtgg 
gaccccatgc 
tgtcacccag 
gttca caeca 
atgcccggct 
aggatggtct 
attacaggcg 
ggcacgggcc 
agtgatccaa 
caagaccctg 
agcctggggt 
aggactggct 
ggctctgact 
tggcaagacc 
ccaagttact 
atgtctggag 
tagaagacag 
ccggcccaac 
tctagacaag 
ecaatttgea 
tcagaaagtg 
catgggagca 
ggtcttgttg 
aaactcctgg 
acatgccacc 
tgttgcccag 
aagtgctggg 
ttggggggct 
tcccctcccc 
ggcggccccc 
ccaggccaac 
cctgcagaac 
cctgtggctc 
agecctggag 
cttccagggc 
gcccggcaac 
cctgctccac 
ctctctgtgg 
acccttcctg 
ctccaggggc 
tctctctctc 
gtctatccct 
tgcctcttct 
cgtgtgtgta 
cccccagggt 
gcccacatca 
aagttgcgct 
gattcccatt 
tgttatgcag 
attgttatta 
aaatcatccc 
aaaatcttcc 



gcctgtcagt 
ctcacgccac 
gagegtgaca 
ccccacactc 
cccaggafcgt 
gttgcttgaa 
ggggtgcagg 
tggaaatcat 
agtgacgg'ca 
gacacctgca 
ataataagga 
gctttggctg 
gaggattgtt 
ctacaatttt 
gctggagtgc 
ttatcgtgtc 
aaattttttt 
cgatctcctg 
tgagccaccg 
tgtggtccca 
getgeaataa 
tctccaaaag 
gtaaaatgea 
ggtggcctca 
ctgeaaggge 
agaccccaga 
tccagtcatt 
acattttggt 
agatgetget 
tgtcaatact 
gtcaaacatg 
gtgetttcta 
gaagtggcca 
ccagggctct 
tgttgcccag 
gttcaagega 
atgcctggct 
gatagtctca 
attacaggtg 
ggcccccagc 
acagctcccg 
agctgcccca 
aacttctcct 
aacctcatcc 
ttctccaaca 
gagctggacc 
ctggagcggc 
atcttccgag 
ctacaggfcga 
gcccctctgc 
cctcagcatc 
tttacttttt 
tctccctcta 
ttccatccat 
gtctgtctcc 
tctgtctctt 
cccctgccca 
cggtgaagta 
tctctccaat 
ttccaaacct 
ctggagaatg 
ttattgatct 
actccatttc 
cttggctaca 



cccatgcgtg 
tacccaaaat 
caacacacac 
aaaatcacca 
cagagtgcag 
gcagaaggtg 
ctcccttttt 
aagaaagtgt 
aaggccaggg 
ctccactctc 
tggatcatat 
gatgeggtag 
tgaggecagg 
tttttttttt 
agtggtgcga 
tcagcctccc 
tttgtatttt 
acctegtgat 
cgcccggcca 
gctcctcagg 
gctgtgatcg 
aaaaaaagaa 
gattcccagg 
cttagagacc 
gaaaagtaca 
gtgagtccat 
ctccaatggg 
tgtcacaact 
aaatgectta 
gaggcagaga 
caatagtgaa 
caatggcett 
ggccacttga 
aggectttat 
gctggagtgc 
tcctcctgcc 
aatttttttt 
aactcctggc 
tgagccacca 
tggcactcct 
cctcggcctg 
tgctctgcac 
ctgtgccgct 
gcacgctgcg 
acctctccac 
tcggtgacaa 
tgeagtcget 
gcctggtcag 
gcctgccctg 
tccccgaccc 
tccatttctc 
cccttctgcc 
actccacaac 
cactgcctct 
cttcacacac 
tctgtgatct 
aaggcctttg 
gagagagaag 
tcactgggca 
gtcatctcaa 
gatgetctga 
aattattgtt 
cccaggaagc 
gagcctccgg 



cacacctggc 
cacagataca 
aaaaccacca 
acccctcagt 
aaacaagtct 
ttcagtcact 
ccccaggcaa 
agaggtcaag 
atgatgggag 
tagcccccag 
ccaaccttca 
ettatgectg 
agtttgagac 
tttttttttt 
tctcggctca 
aagtagctgg 
tagtagagac 
ccacccgcct 
aaaattttta 
aagctgaggc 
taccactgca 
agaagttttt 
ctgtcccccc 
cgacccttaa 
ggaaagtaag 
ttcacacggg 
gtgactttgc 
ggaggcaggg 
tatagggctg 
aaccctgacg 
aacaggaatg 
ttggcattat 
ggctataacg 
ttttattttc 
aagggtgtga 
tcagcctccc 
tttttcttgt 
ctcaagcaat 
tattcagecg 
gccctggaag 
cctcctgctg 
ctgctactca 
gtccctgcca 
gccaggcacc 
catctacccg 
ccggcacctg 
gcatttgtac 
cctgcagtac 
cccccaccct 
tggcgtgcgt 
tctgtctatg 
tctctacctg 
cttcacctct 
ctcactaact 
ccactccgca 
cacgtgtttg 
cagctgtttt 
gcagagccac 
atgggacggg 
tgcaggggaa 
aaatggaagg 
tattgttgtt 
aataacacac 
ctggaagggg 



tgagcagcac 
tacacacatt 
ctaagcaagt 
ctctcccagg 
tcctcccctc 
gtgtgcccag 
aactgecaga 
etagttcegg 
gccctgcacc 
gctctctggg 
aaagttactt 
aaatcccaac 
cagactgagc 
tttgatacag 
ctgcaagctc 
gactacaggc 
ggggtttcac 
cagactccca 
aaaaattagc 
aggaggattg 
ctccagcctg 
aagtaactgc 
aggaattctg 
ggcccctccc 
ggcactgggc 
cctcagatct 
cccccagggg 
tgctgctggc 
cccccacaac 
ttagtctttt 
aagagatgat 
tttttaatat 
ttgtcccctg 
tattttttcc 
tegttgetea 
aagtagttgg 
aaagacaggg 
cctcctgcct 
ggtctaggee 
cccacctagt 
atgctcctgg 
tccccgccca 
cccagcactc 
tttgggtcca 
ggcactttcc 
cgctcgctgg 
cgctgccagc 
ctctacctcc 
cagccccttt 
ccctcctctc 
tctcttttct 
tttaggtccc 
ctgcctctgc 
tgcctccccc 
tacaccccca 
ccttcagggc 
tctcacccac 
agccactggc 
agaagcccac 
gaaagaaaag 
aataccagta 
atgctgactg 
cctccaaacc 
gtgaaaatat 



tgcatttggt 
cacgcacacg 
geaatttgea 
gtctctgaac 
tgccttcaaa 
ggaatgactg 
agaaaatccc 
cctagaactt 
cctattaaaa 
cctgcttttc 

tgggggaaaa 
actttgggag 
aacatagcaa 
agtctcgctg 
cgcctcccgg 
gcccgccacc 
cgtgttagcc 
aagtgctggg 
tgggtgcagt 
cttgagccca 
ggcgatggag 
gaatgaggag 
catagttcct 
ggcacaaaga 
accagtgggc 
ccaaagggtc 
acatttggca 
atctagtggg 
gaggaactat 
gacattaatc 
cattcttcaa 
atgagaagee 
agcccccaga 
cctgaaacag 
ctacagcctc 
gactacaggc 
atctccctta 
tggcctccca 
ttttaccaag 
aagttctgct 
ccctgcccct 
ccgtgagctg 
agegactett 
acctgctcac 
gccacttgca 
agcccgacac 
tcagcagcct 
aggagaacag 
ctggtttcct 
tccccaggcc 
ctcttacatt 
ttgctgttcc 
ctgtctgtct 
atctgtcttc 
tgtctgtctg 
actctgcctt 
cctcaagtct 
atcccacaga 
accccttcta 
ggtaaatctc 
attgttattc 
tttgacaege 
accctgagag 
ccaaatt-ctg 
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ccctctccct 
ctagttatca 
caatagccat 
tacagtagtg 
gagggaaaga 
ttgcctaaat 
gccacagtgc 
ttttttttac 
catctcctag 
gtgcttggca 
aacgagcctt 
atgtgggtca 
acagtttatg 
ttttcacctc 
gatatagtca 
gtcttctcac 
tagggacatg 
tcatggtctg 
aagcacacag 
tttggacaaa 
ggatagtact 
tggagtgcag 
ttcctgcctc 
tttttgtatt 
gacctcaggt 
ccatgcccag 
atttagcaca 
actactacta 
atgctttcaa 
gggcctgagc 
ctaaccctct 
ttccaccaag 
gctggtgtgg 
acttgaagtc 
aaatacaaaa 
taaggcagga 
actatactcc 
aaaagcccat 
ctcaggccaa 
caggtggaca 
atggtaccag 
gaaaacacag 
aggtgaggaa 
ccaggcctgc 
gaatggggaa 
gtatctacca 
gtggccctgt 
tgttttcttt 
agtggcacga 
tcagcctccc 
tttttagtag 
ggtgatctgc 
gccgacactt 
ttttgttttg 
cgcaatcttg 
tccccagtag 
agagacgggg 
acccgcctcg 
ctgaatggtt 
ccacacctct 
ggacccatct 
aggagatact 
tagctatcca 
ccaaggtcac 



acttgaacct 
atctgctagt 
aacaacctaa 
aacagaacag 
agtaaaatgc 
agtgacagta 
caagcacttt 
ctcctcagac 
ggattttgaa 
tacggtaagt 
atttaacatt 
acttcaagga 
aagtgcactc 
cttgttaaca 
cctccaatca 
catcatgttg 
gaatgaatgt 
gctacaaggg 
tttggactca 
ttacttaacc 
taataataat 
tagtgcgatc 
agcctcctga 
tttagtagag 
gatccactca 
ccaataataa 
aggattaaaa 
ctactactaa 
aatctcctta 
ttgaatcccc 
ctgaacctca 
aatggctgta 
tggcccatgt 
aggagttcga 
attaggcagg 
gaatcacttg 
agcctgggtg 
agttcagtgc 
caacagcaat 
ttggggtctt 
gcatcatggg 
cctctgcctg 
acatgggctg 
cccaaagggg 
gggccaggca 
catgccaggc 
tctcccagag 
cttttttttt 
tctcggctca 
aagtagctgg 
agacggggtt 
ccaccttggc 
gttttctctt 
tttttgtttt 
gctcactgca 
ctggaactac 
tttcaccatg 
gcctcccaaa 
taaaaatagt 
aatttgctct 
gtgaaatggg 
actgagctca 
ttaacaccag 
ccagctagta 



ggaacgtgct 
tggaaaatca 
caaacatact 
accaaacccc 
acggtatatt 
atagcagtag 
ataggtatcc 
ctcagttttc 
agcattaaat 
gctatataaa 
ggtttcagtg 
ttatactaag 
agccacctca 
atggagaagc 
taaatctttt 
agcctcacaa 
tcctgaggcc 
gacagtactc 
gcaagaccta 
tctctcagtc 
cttttttttt 
tcggctcact 
gtaactaaga 
aagggtttca 
cctcggcctc 
tccttattta 
tgtaaggcat 
tactgagatc 
tcaatatata 
gatctactat 
cttactttat 
aataggaaac 
ctgcaatccc 
gaccagcctg 
cggggtggca 
aacctgggag 
acagagcaag 
tgaagaaatc 
caggacctga 
ctgaaatggg 
aaggaagcaa 
tgaactgcct 

gggtgagatc 

agctttggag 
ctgtgccagg 
aatgttccag 
catattccct 
tttgagacga 
ctgcaacctc 
gattacaggc 
tcaccatgtc 
ctcccaaagt 
tcagtcatta 
tgagacagtc 
gcctcacctc 
agacatgtgc 
ttgcccaggc 
gtgctgggat 
ctttatgctc 
atggctttgt 
gataacctgt 
cagcccaatg 
ggaggacacc 
aggaacatga 



tcctctgcct 
ggtcagtgct 
gagcacccac 
ctgccttcac 
ggaaaaatat 
ccgccaccac 
actctgccat 
tcatctgtac 
gcatgaataa 
tgcttgttaa 
aagtggccca 
gtcatgagtg 
tctcatttct 
tgaggctggg 
caaccattgt 
caacctggtg 
acacacccag 
tggagtacaa 
ggttcaaatc 
tccatttcct 
tgagacgacg 
gcaacctctg 
ttacaggcat 
ccatgttggc 
ccaaagtgct 
agaagttttg 
ttagcacata 
aaatactact 
ttagttattt 
tttctgactt 
ctgcaaactg 
gagttagtgt 
agcacttcgg 
gccaatatgg 
ggtgtctgta 
gtggaggttg 
actctgtctc 
atgttattat 
ggtcagcaaa 
aagtgtttgt 
cttcacacct 
ggtagggctg 
cgcagggtgc 
gaaactccac 
tgagttcatt 
gtgccaggga 
actcaagtgt 
agtctcgctc 
tgcctcccag 
atgtgctacc 
ggccaggctg 
gttggattac 
cagtggcctg 
tcactatgtc 
ctggggtcaa 
caccatgtcc 
tggtctcaaa 
tacaggcatg 
aagcagatca 
gcaagttatt 
accttggcga 
tctggtacaa 
aactgaagct 
cctagaattg 



catccagggc 
gatgatgcta 
tacgagctag 
agagatacca 
gtcttatatt 
ttagtgggta 
ttacaagcgt 
aatggggtag 
tttgtaaagc 
aatactattt 
acttggactc 
agtcccagaa 
acagcccagt 
ggccctgaag 
cggtgtgacc 
atagggacag 
gaagagctgg 
ttgagcaggc 
ctggctccta 
catctctaaa 
tcccactcta 
cctcccaggc 
gtgtcactac 
caggctggtc 
gggattacag 
taaggattaa 
tgggcactat 
acaaattgat 
aggaggaatt 
atttaacttt 
ggaataatga 
atagaaagcc 
gaggccaagg 
tgaaaccctg 
atcccagcca 
cagtgagctg 
aaaaaaggaa 
gaccccatcc 
ggcttgggca 
tctctacgcc 
ggccttttat 
ggctgggaga 
aggtgtgacc 
cagaggacca 
catcaacaga 
ttcaggagag 
agccagatga 
tcttgctcag 
gttcaagcga 
atgcctggct 
gtcttgaact 
aggtgtgagc 
catggttttt 
acccagctgg 
acaattcccc 
agctaatttt 
ctcctgaact 
agccaccgta 
gatctcagtt 
taaccactct 
gcaggggttg 
agtgagtatc 
cagcaaaata 
gcccaggtct 



tagtgcctaa 
atgataataa 
atgctaagaa 
ttcccatgag 
attcttattg 
cacagggtca 
gtgacatttt 
caagagcacc 
ctcttagaata 
taaaaaaaga 
catcctgaag 
attgcacctc 
tgggagatta 
accctataga 
ggaggcttat 
ttaggggcac 
cgcttgaacc 
tcatttttga 
tatatatgac 
atggcaatca 
tcgcccaggc 
tcaagtgatt 
acccagctat 
ttgaactcct 
gtgtgagcca 
aatgtaaggc 
aataataatt 
catgcattta 
tggagtcaga 
aagcaggttg 
aaataatacc 
catagttcag 
tgggtggatc 
tctctactaa 
ctagggaggc 
agatcgtgct 
aaaaaaaaaa 
tccattgact 
gaggggacct 
cctggcatga 
agaggagatg 
tgccacaggc 
caagatggag 
cagcttttca 
tatttactga 
aacagaaaca 
taaagacact 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgaccaca 
caccgcaccc 
gtttgttttg 
agtgcagtgg 
catcttagcc 
tctattttat 
taagcaatcc 
cacagctggc 
tgaattccag 
gagcctcgat 
tgaggattaa 
caatgaatgg 
aaagcacagt 
gtctgactcc 
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agagtgcagt 
ttggacccta 
agataataat 
gtagcacaga 
gccgatgatg 
cacagagtag 
aattcataaa 
tcacagatga 
aagcagctca 
aagaacagga 
cagagagctg 
ggatgacttt 
aggggcgggg 
ttttaacagt 
ggatttggca 
gaagggaggg 
agaaattctg 
ctctggaact 
tgctcactct 
tgtcctactc 
agggattcca 
tgagtcaggc 
gggtggtggc 
ggacggatct 
agggcggaag 
tgggtgtagg 
gtggcaggca 
gggaacaggc 
agacctcctg 
cagtgaggtc 
ccccagtgct 
aagtcagctg 
ccactggccg 
caaggtcctt 
gacagcgcct 
cctctggggg 
ttcccccagg 
tattgtcata 
cttaacagca 
gaatattgta 
acagctaggg 
cttcataatt 
tctttatgtg 
ccccatttta 
taatggctaa 
ctcatggaac 
ttcatttaat 
tgcccacggt 
ccagtctcca 
gccacactgg 
ctcctgaatt 
aatacacccg 
aggttagagg 
cttctgccac 
tgcctcaggc 
gcggacctgg 
gagcacgtgt 
cagggcgtgc 
aacaacagcc 
ctgcggctca 
ttccagcgcg 
ggccgagacc 
acgcggccgg 
gaggccgggg 



tgttcagagg 
ggcgagtcac 
agtatccacc 
gcttggcaca 
acgatgatac 
acactcggta 
aatcagatgc 
aaagacccag 
tttgattagt 
tgctgtcagg 
tgaaccggcc 
agcagaggct 
ggagggctag 
ggagagagga 
acagcttgga 
gcaggacttt 
tgggccaaaa 
ttggtctact 
cccccaaccc 
ttgctgacac 
agcctacccc 
tgaggccagg 
agcagggggc 
caatgtggcc 
ccagatggga 
ctgggagtca 
tggggaggaa 
cccctgggtg 
gaacctgctg 
agggagctga 
ctcctctaaa 
atcatcagaa 
tgggccacac 
tctgcactaa 
ctaaggggag 
tggagctgat 
gagctcctct 
gtagccacca 
tttatcatcc 
catactgagg 
aaaaggcaag 
ggaccaaatc 
aaccatcctc 
cagatgagga 
caaacaaaaa 
tcatagaact 
ctctacatta 
ggcatcagtt 
tacttcacca 
cttccctggg. 
agaaactctg 
aaagtttagg 
aggcagaatc 
ggtgcacccc 
ccattctctt 
ccaacctgag 
ttcgcggcct 
accgcgcggc 
tggcctcgct 
acgctaaccc 
cgcgcgtgtc 
tgcgcgcgct 
gcagccgcgc 
cgcccccagc 



tctctggagt 
ttcacttctc 
tcatagggtt 
tggtaagagc 
ccatcctaga 
cagctctgtg 
agatgggatc 
gcccagagag 
gtcagagcca 
gaagcaggca 
agtggggagg 
ctcagcccag 
ggctgtgaaa 
aggaccggat 
tggggggagg 
ttgcagagaa 
cctggggctg 
cttctgacct 
ccggcccctt 
ttcataaaaa 
ctggatgggc 
gaggtgcaag 
ggagaatggt 
aagaggaggg 
gggggcgaga 
atgccctccc 
agcgtctccc 
aactggcctg 
accacagtga 
agtcctgctt 
aaagtgagct 
tcccctggtg 
acacccgccg 
tgtggcctga 
tagtaatgca 
gcccctcacc 
tttaaagaaa 
tttattgatg 
aaccctcctt 
aacctgagac 
ctggattttg 
cctgtgtgct 
tggaatcctc 
aacaggcaca 
caaaaacaaa 
ccacaaggaa 
tacagatgag 
aatgacagat 
aaccagaagt 
gaacctgtag 
ccccccgccc 
aaccactgac 
caaatccagg 
cttccctccc 
cttctgtgcc 
ccacctcttc 
gggcagcctg 
cttccgcggc 
gcccggcgag 
ctgggcgtgc 
cagctccgac 
ccgcgaggcc 
ccgcggcaac 
cgatccctcc 



tggaagccac 
tgaggctcca 
gtgacaatta 
tcaatcagtt 
ctgatgagct 
gaatgaatga 
ttaaagatca 
gtgcttggag 
agagctggga 
gggatgctgt 
caagggaaat 
agggagggga 
gtcaagagct 
ttgaaagcta 
aggcaatgga 
acaaaaggag 
tgggtcaaag 
cccgaggtcc 
atcgatcctc 
gaggaacccc 
ctggaagaga 
gagccagctg 
ggtgtcagag 
ctcttggcac 
acaggcagga 
ccaacctgag 
caggcagtga 
agcagagtgg 
tgccctgcac 
ccctctctgg 
gggctgatgg 
agctggttat 
ccccccgctg 
ttaggtgact 
atgtggcttc 
ccaataccca 
agggacagga 
gttgactatg 
tagcctgctg 
tccatgaggt 
aactagggct 
gggcacgtgt 
agaacaaacc 
gagagatgac 
aattaaaaaa 
aggtgttcta 
gaaactgagt 
ccaagatttg 
tctgaaactc 
acatggggat 
caccccgctc 
ctcaccaata 
atgctatgaa 
ccggccaagg 
ccactccacc 
ctccacggga 
gaccggctgc 
ctcagccgcc 
gcgctcgccg 
gactgccgcg 
gtgacctgcg 
gacttccagg 
agctcctcca 
accctctacc 



gttccactgc 
tctcgtaatc 
agttactata 
acctgcttga 
ctgtaagcgg 
ggcacatccc 
cctatcctaa 
ctgcgcaagg 
gtttggaggg 
gttaagattc 
gtggtttttg 
gatagggagg 
tattaatgca 
cattcaagga 
ccccaaggca 
aggagaggag 
gcacctgaat 
cccaaaatgt 
tgaccataca 
atttaggtgt 
acaagagcac 
gaggcctgag 
gcagccgaga 
gctcagttcc 
gcacaggaag 
gcctccgacc 
gggagggaga 
atgctcctgt 
aagaggggag 
caagccctta 
gtgccaaggc 
aatgcagagt 
ttaattctga 
ccctagcacc 
cttcctctcc 
gcctagtagc 
cccaattgtt 
cacctgccag 
agggggttat 
taaaacttgc 
ctaagtgctg 
ccagcacttc 
caggaagtag 
tggcttggcc 
aaaaaaagaa 
agcaccttca 
cacagatatc 
aaatcagaaa 
aaactgtggt 
tcccaggctc 
agagatccgc 
ccactttttc 
tcaaaaggtc 
ccccagcggg 
ccacccagga 
accgcctgcg 
tgctgcacgg 
tcaccatcct 
acctgccctc 
cgcggccgct 
ccaccccccc 
cgtgtccgcc 
accacctgta 
gagatctgcc 



atattagctg 
tctgaaatgg 
taggatctgt 
caatgctgac 
gggtgcctgg 
agaactcacc 
gtcccttgtt 
tcacacagcc 
aggcaaggtt 
caaatggatg 
aaatggaaga 
ggagataggg 
tagagaacgg 
agtggcaacg 
gaggctcaga 
gttagaatca 
tccctaggat 
ggattacccc 
tctctgggtg 
tttgagtggc 
caggccatgg 
ccaggatttg 
aggttgaggg 
tgtagcgaag 
gtggaggctg 
aggctcctgg 
gccacagtca 
tctgagaccc 
gacctcaagg 
tctctttgag 
attagctccc 
ccaggaatcc 
accatagttc 
aggcaggtgg 
tcccctgccg 
agtactttgg 
actgagcccc 
atactgtacc 
acataataag 
ctaaaataac 
agcctgtggg 
cctcatatga 
gtatactcat 
aagttaagaa 
taatggctaa 
tacatgctgc 
ctgagtgact 
ggctggctcc 
cctgccaatg 
caccccaaac 
aggggatcct 
cacagcaaat 
aaccctttct 
gtctgcaccc 
tgacttgttc 
gctgctcaca 
gaaccggctg 
ctacctgttc 
gctcgagttc 
ctgggcctgg 
ggagcgccag 
cgcggcaccc 

tgccgaagac 



135360 

135420 

135480 

135540 

135600 

135660 

135720 

135780 

135840 

135900 

135960 

136020 

136080 

136140 

136200 

136260 

136320 

136380 

136440 

136500 

136560 

136620 

136680 

136740 

136800 

136860 

136920 

136980 

137040 

137100 

137160 

137220 

137280 

137340 

137400 

137460 

137520 

137580 

137640 

137700 

137760 

137820 

137880 

137940 

138000 

138060 

138120 

138180 

138240 

138300 

138360 

138420 

138480 

138540 

138600 

138660 

138720 

138780 

138840 

138900 

138960 

139020 

139080 

139140 
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tcgcgggggc 
ggtgaggacc 
tcccgaggcc 
gtgccccacc 
tcccgtccac 
cctccctttc 
ccccctgggc 
gccctcccca 
ggcagccgct 
cagagcgtct 
ctttggggga 
tcaacctcct 
gcacctggag 
tgtttgggaa 
acttgggacc 
ttggagaagg 
cgctcccccg 
gccaagagtg 
caaccctgag 
acagccctct 
gtcacataca 
ggcactgctc 
accaaggctg 
agcgtacaca 
aggtagaagc 
agggaccgag 
gcttgaggta 
gaagagctca 
gcacttggag 
gtggtcgctt 
ggccaagaat 
ttttaagaat 
gggaggctga 
ggcagaaacc 
ctatagtccc 
gactctgtct 
gcctagatgc 
aaatgatctt 
tagactccgg 
ccagggagct 
taatgggttg 
gcctgggcag 
ttatcattgt 
aagaggagtg 
acccctccca 
cccgagctct 
ccacccacgg 
gcacacccac 
ccccaaactc 
ggatgttcaa 
tagaaaggaa 
cctcccacat 
ttgtaattaa 
cccagccaca 
cattacctca 
atggaatctc 
aatttgcata 
taaaggtaaa 
ttctagcctt 
catactggcc 
aaaatttctc 
cagagccctt 
gcagcagaca 
accttgggaa 



gccagggcgg 
agcgagggga 
ctgcgctctc 
acctctgact 
ccggggctgc 
ccctcccagc 
tgtcctgact 
gttctggcca 
gacccgcact 
tccaccagca 
tgcctcagtc 
gacactggag 
agaactttcc 
aaaaaattta 
ccgaaagcgg 
ggagccgtcc 
ccatgaggag 
ggtgctggtg 
tggtctggcg 
tccagaccct 
cactcctggt 
cctcagggaa 
ggtttccgcg 
cctgcacaca 
catcagggat 
gaactcgggt 
caatcagaac 
ggctaagaag 
agaagtttgg 
gcacctgtaa 
ttgagactag 
ctagtttgga 
ggtggacaga 
ccgtctctac 
aggtactcag 
aaaaaaaaaa 
tagggcttga 
gagaagaaga 
ttaagggtgt 
cggagtctgg 
gattggaggg 
ggggagggga 
gaaatattca 
tgtgagcctc 
ccaccagccc 
cacacacact 
gtcccacaaa 
tcaatcagga 
tccaaacaac 
ggccatccag 
gggatttgtc 
tctgagtgta 
ctcctccctt 
gtctatccct 
tgcctggaat 
tccctccctc 
tgttatgatg 
acaccttagc 
ttctctcaca 
accagttatc 
agtctctcct 
tcccaaggcc 
gaggctcagg 
gcacagccag 



ggacgcgcct 
gcagatgtgc 
ggccgggctc 
gcggtgctga 
ggctccggcc 
tcctctcctc 
tgtggcagcc 
ttaactcttc 
cctaagggcc 
gagcctttgg 
agggccaggc 
gaatactttt 
tctccactgt 
ttaaaaaatt 
gatgaggtct 
cccgccatca 
gagctgaagc 
gcacctcttc 
cctgaggaac 
gttccttcag 
cacagacagc 
acacagcccc 
tacacttgcc 
gcaaccctgt 
gaaccaggtc 
gatcagaggg 
cttcatcttt 
gacttgggtt 
tagtttgaat 
tcccagctaa 
cctggacaac 
gtggggtgtg 
tcacttgagg 
taaaaattca 
gaggctgagg 
aaaatctagc 
ggaagcaggc 
gaagaaagga 
ggggtgggct 
ggtcttgcag 
acagccagac 
gctgctggta 
tgggccatcg 
aggaagaagc 
agacagaccc 
cgcaccaagc 
caggcaggtg 
cccagcagag 
ccagattcac 
cccaaccccc 
tgagattcct 
cccccattct 
tctctaattt 
tccttcccat 
ataaaagaaa 
ccacccagcc 
tcactcagga 
agagcatcaa 
atctcatccc 
atgtatgtga 
gaagtaggaa 
aagactgcac 
ctctggctcc 
gtttcagtcg 



actgaggacg 
cccggcgctg 
cccagccctc 
gatcgaagag 
ccagtcgccc 
cccggggagc 
ccaagagggc 
cccatcccaa 
cacagcggac 
aagctccccc 
tgaccctgac 
ctcctaagtc 
ggcccctgcg 
ctattatttt 
cagaatgtaa 
aagagcttcc 
cctgcattct 
tcttcatttg 
taagcctggg 
gaaacaagag 
cccaacatgg 
atgccagagc 
tccttggcta 
gggtatgtgg 
agagaagcag 
acaggaatcc 
ctggtgtgtg 
gggatttagg 
ttggagctaa 
ttgggaggct 
atatcgagac 
atgtctcaac 
tcaggagttc 
aaaaaattag 
cacaagaatc 
ttgggaggtg 
tgaggttctg 
gaagaaatca 
gggggagagt 
gggcttgtgt 
aagagctctg 
aattagaggc 
gtccagatgc 
ctgaagcaaa 
acggacgccc 
agagccatgt 
tcccctaaat 
atcacctcca 
caccttgacc 
accaatgctc 
ctatcccctg 
gcattttcgt 
cctgcatcta 
tctccccacc 
actgcgataa 
ataccgtggt 
gcctgtaatg 
agatccctca 
aaaccttcac 
aacccaccaa 
acctcttccc 
ctctctgacc 
ctccaagcag 
tctagaacag 



actactgggg 
cctgccaggc 
tgctttgcct 
gccagtgtcc 
caccttccct 
aggccgcctc 
gtgtgtggtg 
ggctggggtg 
accagaggcrg 
agggagcccc 
ccctgcttac 
taccctggac 
tggtgaagat 
atctactgta 
ggattgcagg 
tggtggctgg 
aggtgaggcg 
tccaggggaa 
gaagacctgc 
cagttctcct 
ctttgggtaa 
aaacacctta 
agtgcccttg 
tctctctctc 
gtttccaaac 
caaattggga 
gaagagaggc 
ggtgagtctc 
gaatctagct 
gacgtgggag 
tgagtctctt 
gtctgtaatc 
aagaccagcc 
ccaggcgtga 
actccagcct 
ggaatagaaa 
tgattctggc 
gcatctaagc 
gagagcagct 
caggctggct 
gtggaggagg 
aggctgtcca 
tatttcagaa 
gccactctcc 
atcacgtgca 
agcacgtgca 
tctgacatgc 
gcgatctcac 
cacacaccct 
tgatggggaa 
aaaaaagcaa 
ctgccagaca 
ttcagttacc 
acctccctgc 
cctcctcgct 
gaccagattc 
gcttcctaat 
gagtctggta 
tccagctaga 
ccgactttga 
cctcctcaga 
atatacaggg 
acggctgccc 
agaatgagca 



gggctacggg 
gcccccggac 
cctgctcctg 
gatccccgct 
ggccttgctg 
tccttgcctg 
gctcagccct 
gggcccccca 
cttttgtctg 
acccaggacc 
cctagtcccc 
actttttagg 
caaaagaagt 
agatttgttg 
gccaggaggg 
aggtggtgtg 
cagtgtggca 
gagctgcagc 
tgtctggtta 
gcaaggagga 
atgtgaacaa 
gcaaacagag 
tgcagtgcac 
agctcctgtg 
aggctagaag 
tgcattactg 
tggggactgg 
atcagactga 
tgggcagggt 
gatcacttga 
aaaaatgttt 
ccagcactct 
tggccaacat 
cggcgggtgc 
gggtgacaga 
gatagagggg 
tagggaggtc 
ctgaggcagg 
ggtcagaaac 
gtgaggaggt 
ggctgctggg 
ggtcatagaa 
cagtgaaagc 
accaaccccc 
cacccacact 
agcacaccaa 
acactgacat 
atgcgcagac 
gagataggag 
actgaggcca 
aattcattca 
cccagcctag 
cagtccccca 
tccagctact 
ggtttcctac 
atctgatcaa 
gcctataggg 
ccaactgctt 
acgtttgtat 
gtgcccccct 
tctcagactc 
gttcttcaaa 
ccgactggcc 
tctaaccgcc 



139200 
139260 
139320 
139380 
139440 
139500 
139560 
139620 
139680 
139740 
139800 
139860 
139920 
139980 
140040 
140100 
140160 
140220 
140280 
140340 
140400 
140460 
140520 
140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
141120 
141180 
141240 
141300 
141360 
141420 
141480 
141540 
141600 
141660 
141720 
141780 
141840 
141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
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tggggagagg 
ttacacccag 
attttcgatc 
aggacactgt 
accaggtggc 
ccagatagac 
gcccttctcc 
ctctggagaa 
taagctgtgt 
tgtttttgtt 
tcagtgcaat 
aagttctgcc 
tcgaaggtag 
aagatccttt 
tgcaaggctg 
agtgggcagt 



actaggacac 
agtagggggg 
cctaatccga 
ggaagggaaa 
aggcccaagg 
ttgtcagatg 
aaccccattg 
acggttttgg 
gaccttgggc 
ttgtaagctg 
agcccatggc 
tccctcttcc 
gaactttggt 
ggcctttgta 
gtggagtgag 
ggagagagg 



cagatgataa 
ggacggttct 
gaggaactcc 
tttcccaaca 
tccagctggg 
ttctgaaaac 
cccctgcagg 
gacagtagaa 
aagrtactta 
gggacaatag 
aaagtgacct 
agcttttctt 
tggttctccc 
gaaacttcag 
agaggaggat 



ggtttataag 
cagccctgtt 
tctccaatga 
gactctgaga 
gtcaggtttc 
tgagcatctc 
agggcaggcc 
gggtaaatga 
acctctctga 
tgccagccta 
acatgcttgc 
ccaaccccaa 
ctttctcttc 
gtgaggaggt 
agatggcgaa 



cccttaagcc 
tccctagctg 
aatacagact 
gtccaggagg 
tatatgaatt 
ctttcacctc 
tgggacagat 
cctagttatg 
acattagagt 
aatcaatttg 
tgttattatt 
agatgtctct 
aggcccaaac 
ggcagagaaa 
gccctagcag 



tctaaggttc 
cgggctccca 
tgggactctc 
ccagggatag 
tttaatgctt 
tgtacatgat 
attcagtggc 
ttcccactag 
tctgtgggtt 
ttgtggggac 
ctctttcctc 
ggctattgct 
tccccacctc 
taagaaagtg 
aggggaggga 



143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143899 



<210> 16 
<211> 215980 
<212> DNA 
<213> Mus sp. 

<220> 

<221> modified_base 

<222> (1001) . . (1100) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modified_base 

<222> (2123) . . (2222) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (3728) . . (3827) 

<223> a, t r c, g, other or unknown 
<220> 

<221> modif ied_base 

<222> (5168) . . (5267) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (7481) . . (7580) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjbase 
<222> (8849) . . (8948) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (10375) . . (10474) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (12270) . . (12369) 

<223> a, t, c, g, other or unknown 

<220> 
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<221> modified_base 

<222> (13438) (13537) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (15902) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (15939) . . (16038) 

<223> a, t, c, g„ other or unknown 

<220> 

<221> modif ied_base 

<222> (18223) . . (18322) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (20974) . . (21073) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (24403) . . (24502) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (27574) . . (27673) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (30892) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (30901) . . (31000) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied^base 

<222> (34443) - . (34542) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (38205) . . (38304) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (42373) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42386) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (42393) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 
<222> (42461) 

<223> a, t, c, g, other or unknown 
<22 0> 

<221> modif ied_base 

<222> (44809) . . (44908) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (51380) . . (51479) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (56740) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 

<222> (56765) . . (56864) 

<223> a, t, c, g r other or unknown 

<220> 

<221> modif ied_base 

<222> (62818) . . (62917) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (68518) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 

<222> (68534) . . (68633) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (74552) (74651) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (81446) . . (81545) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (88519) . . (88618) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (93791) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (93794) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96565) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (96570) . . (96573) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96579) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96590) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96596) 

<223> a, t f c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96602) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96616) 

<223> a, t r c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96629) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96633) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (96668) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (96715) (96814) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (104447) . . (104546) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modif ied_base 
<222> (114521) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (114527) (114626) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (127063) (127162) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modified_base 

<222> (139133) (139232) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 
<222> (151051) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (153242) . . (153341) 

<223> a, t, c f g r other or unknown 

<220> 

<221> modif ied_b as e 
<222> (164706) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (164708) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied__base 

<222> (164710) . • (164809) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied__base 

<222> (182242) (182341) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied__base 
<222> (192158) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (192192) 

<223> a, t, c, g r other or unknown 
<220> 

<221> modif ied_base 

<222> (198842) (198941) 

<223> a, t, c r g, other or unknown 
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<220> 

<221> modified_base 

<222> (199437) (199438) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modified_base 
<222> (208276) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modified_base 
<222> (215974) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (215976) . . (215977) 

<223> a f t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (215979) 

<223> a, t, c, g, other or unknown 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



900 
960 
1020 



<400> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 
caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 
cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 
tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 
gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 
agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 
cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 
tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 
tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 
cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 
ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 
ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc /^U 
tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 
taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 84 0 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 
tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 
ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1UBO 
nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 1140 
tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 
tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 12b0 
acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 
ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 
taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 
aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 
gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 
cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 
ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 
atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 1740 
cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 18 00 
aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag 1860 
cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 19^U 
gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 
cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 
catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 
ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 
nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc 
gaataaggta 
cacacaccac 
atgtggctca 
aagctggaag 
tcaagagagc 
gccttgccta 
atattaatca 
atgaaaggaa 
gcagagagtg 
gccaaagata 
gacagttgat 
gcccctgaag 
gaagcctcag 
gcttcctgtc 
cacaccctca 
ggtcacacca 
aaccttgttt 
ctgctggcta 
cgctacagat 
gcatagagaa 
gacgagagaa 
acatcactca 
atctgtatct 
acaccaannn 
nnnnnnnnnn 
aaacacacac 
caggctaccg 
caacgtgggc 
gctgctgcta 
tgttcaaccc 
gacggatgtt 
gccctgaggg 
gatgaaccac 
ctggcattat 
atgtcaaaac 
tgaaaatatg 
gctggagaag 
agcagtcacg 
gagccacaag 
aggaatgatc 
ccactgaatg 
atcccagcac 
ccaggacagc 
tcaaaacaac 
tagtacctgt 
aagtgtgttc 
aactacatgt 
ctgctaannn 
nnnnnnnnnn 
caaaccagta 
ggagttgctt 
tgggcttcca 
ccatgcacac 
ggctctctca 
gtttcagtca 
acccacgtgt 
cctgatgccc 
caaaaagcaa 
ggcactggat 
atcaaaacaa 
tttcagatga 
accaaccttt 
cttattcttt 



acctaggcaa 
gagagccgcg 
acacacacat 
gtgggtagaa 
gagacaacca 
ctggagtgga 
gacaacggca 
tgtatagata 
aggaacaagg 
gcacttccca 
aacggctcaa 
actgtcaact 
gggtctttgg 
acagaataaa 
tgtggctgca 
aaccgtgaac 
atgagaaagg 
gcatgtgtga 
agggtgagag 
ggaaggcagc 
aaccagatta 
ctaagaatac 
agtgtctctg 
actacagcac 
nnnnnnnnnn 
nnnnnnnnnn 
caccagttag 
tcaggacggc 
ttcaggggct 
ttcaaggatt 
atcactcatg 
ctggaaaagt 
taagcaaagc 
ttgcccatcc 
taaaaacgat 
aatgtgtaag 
aagtaaaatt 
ggaccgaggc 
gactcagccg 
gtgattggcc 
ctggagtctc 
atgggtagat 
ttgggaggca 
cagagataca 
aacaacaaca 
tggtgagttt 
tggaaatggg 
cttccatcaa 
nnnnnnnnnn 
nnnnnnnnnn 
aacaaaatcc 
agaccataag 
agtcaggagg 
acacgcgtgt 
catctcagcc 
actcttacac 
cattgcttca 
cagtcattgt 
tatccaattg 
tcccaatctt 
ctactttcct 
gttactaaga 
tctcattctg 
ccccacatct 



aaaacccagt 
gaggaagaca 
tttttttgtt 
aaacctgctg 
accacaactg 
agacactgga 
gaacaaacat 
agagggaatg 
cacacaaagg 
gaagggagtc 
aaaactcacg 
taaccgtatc 
actaggttaa 
aaggagccag 
gcgagacacg 
caaaataacc 
ttaactaata 
gacfcttgaag 
tgcacaccta 
cagggcagct 
atagagtgtt 
agaagtctac 
gatgctacac 
tgctgtttta 
nnnnnnnnnn 
nnnnnnnnnn 
agaaagttaa 
tcactgccac 
ctaaacactg 
cccaaagctt 
agcagtcggt 
tgacagaaaa 
taactggcag 
tcctgacacc 
agaaacagaa 
atggcatcga 
aaaattggaa 
agagataaca 
gaaagggtgg 
caggagtggt 
ccatctattg 
taactaaaag 
gaggcaggtg 
tagagaaacc 
aaaaaaacaa 
gagtgagtga 
agaaagagaa 
atgcaatgtt 
nnnnnnnnnn 
nnnnnnnnnn 
tgtaagataa 
gagttcataa 
agaccaagga 
aaatacgtag 
aacatataaa 
agaatggcca 
gtattaacaa 
cttactgaga 
agagttacaa 
taatccaacg 
acaaagacag 
agcttacttt 
gactaacttg 
ccaatgtttt 



gatctctggg 
ttgggggttc 
ttaaatttac 
tctaagcctg 
ctagcagcca 
tccccagcag 
caaggctggc 
gcacagacag 
acaaagaacc 
ataaaataga 
gaaaacaggt 
tagaactcca 
ctgacgtggg 
tgagctgagc 
gtgcttcctg 
tcctctctac 
cagcactcaa 
ctcacgggcc 
taatcccagc 
gagactgact 
ccccactatg 
tgtgtagaag 
ggagggcgtg 
gtatgcgctc 
nnnnnnnnnn 
nnnnnnnnnn 
tcaggccgaa 
tggcaactct 
caatcaaagg 
agatgtattc 
accggggtga 
gttgattcat 
gagactaggt 
taaatactaa 
gtatcaatag 
acacatgcag 
agaaagacaa 
gcaacgtgtg 
ggcagcctct 
gccaccttca 
atatatgaaa 
aaatttaagc 
gatctctgtg 
ctgactcgaa 
aaaaacaaaa 
gtgagtgtgt 
tgtgcatgtg 
taattatcta 
nnnnnnnnnn 
nnnnnnnnnn 
agcctaagac 
tccaggcgtg 
atcaacaagt 
atgttcactc 
gcctgcatta 
tcatactccg 
aacccagaat 
ctactaagtc 
cctataagga 
tcttcagcag 
acacaagtta 
agtagttgtc 
agaagtatta 
tgtctttaat 



ccagtgtgtt 
tatgagtatg 
aaacattaaa 
gtacgagttc 
gaagcactgc 
gcaagcctgc 
agagctgtcc 
aacaattcaa 
tagcatacaa 
ctgaattcat 
caaaataacg 
gcaggcacat 
agtgacacca 
gtcagtgctc 
ctttagctgc 
attgctttta 
tacttaaaaa 
agttatgccc 
tgctgtggaa 
caaactgata 
caagaagccc 
cactgctagg 
tgaagtattg 
ctccactaca 
nnnnnnnnnn 
nnnnnnnaat 
tggcggcttt 
gcctgaacaa 
ttgtgtgtgg 
atcatactca 
cctattccct 
taggcaggct 
ttgccattaa 
tgaaagaaca 
tcatgtgttc 
aaactgttta 
tttgcctaaa 
gacatacgga 
gaaggaagtt 
cctcctgcct 
ttcacagtat 
ggggtggtgc 
agttcgaggc 
aaaacaaaat 
caaaacaaca 
gttagagaga 
tgtttctggg 
tgagttgaac 
nnnnnnnnnn 
nnnnnnnaac 
aagacacttc 
agagcccgag 
ctcgactttg 
acacacagaa 
tcagaacatt 
tctacaactt 
aaccagctgc 
acaaggtagc 
ggagtttacc 
atttcataac 
gaattaagaa 
tggctaaaac 
attcctaagt 
ttattatagg 



gtttgcaagg 
tgaaggggtg 
ataggctgta 
aatccctgac 
ttgctaacac 
aagaagatgt 
aggactgttc 
cacacggggt 
gaaagcctaa 
taaaacaaga 
tcacccatct 
ctccaggcat 
tctatggacc 
attgcttctg 
catgacagac 
ccaggcattt 
cataaacacc 
aatgccaggt 
tcagcaaaag 
gaggtgggag 
tgggtttcag 
tcacacagaa 
cttcctgatg 
gctcctcacc 
nnnnnnnnnn 
taatcaaaga 
cccctgtatc 
agcccgcagc 
gggtgggggt 
caggaaagcg 
gtagaaatgg 
ctttgcccaa 
tctgagacaa 
atggattgag 
tttctcccat 
gggaacatgc 
gcagctcaga 
tctggggcag 
aaggtaaata 
caaagtctga 
gttttagaac 
aggtctttta 
cagcctggtt 
taaaagctca 
aaacacccta 
ggggcgggga 
atgtagacaa 
catcttcatt 
nnnnnnnnnn 
aaaaataaac 
ctggggctgg 
ttcaggtccc 
gtctctagtc 
gactgcacct 
ctaggttcta 
ctcctgatct 
gtagatcctc 
actctggatc 
ttcattatag 
ttccaagtcc 
ctctgcagcc 
tgtatccttt 
aaatacttca 
gcaattcatt 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 
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5700 

5760 

5820 

5880 

5940 
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6060 

6120 
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tcctatctag 
gcactgatac 
tttaaaacaa 
atggtcctct 
ctggaattga 
ctccaactcc 
aggtttcttt 
aagttctgta 
ggcaggtagc 
actgggacaa 
agctaagggg 
aacgcacata 
ggctctccag 
aagaccctgc 
tatcatgttc 
gagaaaataa 
ggagtggggg 
atgagaagaa 
gcacagcctg 
tttacagtaa 
gtctgctgct 
ccagcactgg 
ttggcaacat 
nnnnnnnnnn 
nnnnnnnnnn 
taaagtagat 
aggaggaaac 
ccgaaggtcc 
tacggtgtac 
gaaaaaaaaa 
gcacttggga 
gagttccagg 
caaacaaaca 
tgacaaaaga 
tgtggtggaa 
aatcacacag 
aatcacagac 
tggttattca 
gaaccagggt 
aggcctatgt 
atactggaaa 
ggatgtcagg 
tcctgcagat 
cttatccagt 
actccttgtt 
ttgtgaaagg 
nnnnnnnnnn 
nnnnnnnnag 
aattcccagc 
acactcagaa 
aaagaaaaca 
agatcattgg 
cacaactgtc 
acatgtagac 
cgtgtgtaat 
ttcccatgag 
gtatatacat 
ttttttattt 
tagcatgtaa 
ccgaagactc 
tttgttgtag 
catctactgt 
cggcaatgtt 
acttccgacc 



ttccctgatt 
aatttagatt 
gattttaaaa 
gactgaattt 
accagggact 
catctcaaat 
gtcctcctgc 
ttaattagtg 
cagccacagt 
gatgcctaac 
tccagggctt 
aatgcaatat 
agcaagctgg 
ctccaggtgt 
atgtacaaaa 
gaagaattgc 
gatgacagaa 
gggtggggca 
gctgtagaat 
agttattcta 
gagaacctga 
ggaccaggag 
aacatcatgc 
nnnnnnnnnn 
nnnnnnnnnn 
agttgcaagc 
cagacaggtc 
tgagttcaaa 
tcatatacat 
aaaaacctaa 
ggcagagaca 
acagccaggg 
aacaaaaaag 
caagttctat 
cacgtattag 
gccactggat 
tgtgtggaaa 
aacgaggatg 
gatttactgc 
tcctaacttt 
aggtgcaggc 
atgcagcaag 
catccgtctc 
ttaacaaaat 
attgtacttt 
gggaatgaag 
nnnnnnnnnn 
agagctcaat 
atcaccatgg 
acataagtgt 
tgttfccaaaa 
ctgcfcctttt 
tgtaactttg 
aaaacacctg 
ataacacaca 
gttttggttg 
ataaacctct 
tttttttttt 
catcttgcaa 
aagatcacaa 
taagcgacct 
aatctgttct 
cgtctgcatc 
tgtagttccc 



aaaacagtag accttgctgc 
attaaataca aaaccctaaa 
aaagcatgtg ctacgcttcc 
ttcccctcat tctgcttcat 
tattcatgct aggcaaatgc 
atcatttcca aagacatttt 
agctatgact tcattccttc 
attgttttca ttctctctgc 
gcttaattaa gcaacagccc 
atcatctgca gataaagacg 
gggcacgctg agtgtgagca 
ggatgtggca atctacctct 
ctagcaagac aagccatatc 
aactcccaag caaaaggatg 
tgtcaaccgg catacacaca 
aaccaaaaaa tgtaatttga 
ggtgaactga gtggaccgag 
gtctgagcaa gggttcagca 



gctgggcttt atcacacaga 
tcaggctatg ctacttcaat 
caaagatgac ctgttagaac 
ctcgagggtc accccagatg 
ttcagaaatt aaaaacaaaa 
nnnnnnnnnn nnnnnnnnnn 
catgagatag ttaataaact 
atgaagaaag acaaaccact 
cagagagatg gctcagagat 
tcccagtaac cacatggtgg 
taaataaata aataaataaa 
ccaatcagcc aggcgatggt 
ggtggatttc tgagttcgag 
tgatacagag aaaccctgtc 
gaggaagcca gacaggatgc 
aagtgttagg gaaaggggga 
aaggctctgg gtatcctgtt 
gtctcaggca gagaaagatg 
atatttgtaa ggttgtcatt 
ggacaacaga aatgggagag 
gcacgtgtat agggtctaca 
ggtaatgagc ttattacagt 
taggctagaa aggtactaac 
cctacctctg tatgtagtca 
tgfcatccatt attcccagtc 
gctgactgca agctcccaag 
accctgtttg cttgggatag 
aagaataann nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 
ggttaggagc actggatgct 
cagctcacaa ctgtctgcaa 
aggcaatcta cgtaacataa 
tatacaagta atggggctgg 
ggaggttctg ggttcaattc 
gtcctgtggg agctgatgcc 
catacataaa ataagttttt 
ctggcttaac ttcctcagca 
cattctatca ccgaaaaaaa 
ccctaaaata aagttttctt 
ttaagaaatg ggaacaacag 
gtacttaacc gtattctata 
aaaaaaaaaa aggacccaga 
cttccttaga agatactaca 
ctgggattcc aacttgtttc 
cggctgagct cctcgctgcc 
gctctacagt gcaagcgagt 



atgccattat 
acacaaaaag 
ttctgccact 
ctaatatgtg 
tctaccatag 
cttggtctct 
agaacactcg 
tagaatgcac 
aacgatgtca 
aactactggt 
ctggagtccg 
aattccttct 
agtgagctct 
atggctcaca 
tgcacacaca 
ggacacataa 
ggaaagctgt 
atcaccacgc 
attattcagt 
agaacaagcc 
tgtctgccaa 
cagggagtta 
nnnnnnnnnn 
nnnnnnnnnn 
gaagaaagcc 
tgagcttttc 
taagagcact 
ctcacaacca 
taaataaata 
gacacatgtc 
gccagcctgt 
tcaaaaaaca 
actttatacg 
ggacaacggg 
tccgacaaac 
tgatagattg 
gtcacccagg 
agggatgtga 
gggagtggga 
tactatgcac 
tgagggtttg 
atcccttctc 
catcctctga 
cccagggctc 
agtgtgccct 
nnnnnnnnnn 
nnnnnnnnnn 
cttccaaagg 
ttccagttcc 
aaataaataa 
aggagatgtc 
ccaccaccca 
ctcttctggt 
aaaaaagtta 
ctgactgttc 
aaaaaaaaaa 
ttctaaaagt 
ttctgctcac 
tcagctcaac 
ctggataatt 
gtaatgctga 
ctctgctact 
ttgttaaacc 
ggataaggaa 



cctcatggag 
atgatttttt 
aagcctacac 
cttttcaaac 
agctataccc 
tatttagatc 
tcttagcttt 
tttcaataaa 
ttcactacat 
gtcaggagac 
ggtgcccaga 
ttaagacagt 
gggcttgacc 
aatctcaggc 
tgaaaactgg 
ttgcaggcgg 
gctagcggca 
tttactgtct 
atgtgctatc 
tgaaaaagtg 
gtgtggaatt 
gaggccagtc 
nnnnnnnnnn 
nnnnnnnnnn 
atacaaggag 
ttttgtcgta 
gactgctctt 
tctgtacagc 
aatcttaaaa 
tttaatccca 
tcttcagagt 
aacaaacaaa 
tgaatggaat 
ggttcatgtc 
aggcactccc 
actttttaac 
atagagctga 
gaaccatttt 
tatgtagagg 
agcctggaag 
acagcccctt 
aggctatggg 
gtggctccct 
tggctccttt 
ttataaacat 
nnnnnnnnnn 
nnnnnnnnnn 
tctccagttc 
aggggattca 
atgagctgga 
tcaatgggta 
catgacagct 
gtgcagacat 
cacatacacc 
accatacgga 
ttagaagaaa 
acatccttat 
actgtatttc 
acacttacta 
aaacgtttct 
agaaatgaca 
cctcccttgg 
tccttcctga 
gcgcatacct 



6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 
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6780 
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gccgtcfcttc 
ctgccctccc 
gagagagggg 
attaaataca 
cccatgtaaa 
tttttactgt 
gagacaacag 
rinnnnnnnnn 
nnnnnnnnnn 
ttcggtccct 
gcttggagca 
gtcctggaga 
ctgttaaata 
ttttaaggat 
cccaacttgc 
gtcagggggc 
tcaggtgacc 
ttttaatggc 
caggctcgca 
gtggttataa 
ccagcccttg 
gaggtatttt 
ttctgttata 
aggattattg 
agtcatttga 
atttaatcca 
agagatgcat 
gaaaagtctt 
ggaacttagc 
acaaattgaa 
ctgtttccac 
ggcatttttg 
tctgaagagg 
tgcacacaga 
aaactcaggc 
ttaagaattt 
atccataaag 
agatggttca 
acttcccaga 
nnnnnnnnnn 
nnnnnnnnng 
cttctattcc 
actcttggac 
gaafcgtactt 
cctgtaggat 
aggcagaata 
ctttagctgt 
ccgaaactct 
acacatggtg 
ttcaaaaaga 
aacaaaacaa 
acggacatgc 
taggtatgat 
ttttaacccc 
cactggcaag 
gattattaga 
aagaataaaa 
gcaggtgatc 
nnnnnnnnnn 
nnnnnnnnnn 
agaaattcaa 
taaaaaactc 
caaaatatat 
cttgatagca 



agggtgttga 
tcgaccaggg 
ctcagtctcc 
gctgtttaaa 
agtctgaggc 
aattctagtt 
gcaaatattc 
nnnnnnnnnn 
nnnnnnnnnn 
cgaatttggg 
gcagaggggt 
acagaaacfct 
taaatacctc 
caggcacaga 
ctttctgcta 
tacgtggagg 
tggcttggca 
cccttcccct 
ggtccaccag 
gcatgtgcca 
agtttactgt 
gtcaagtttg 
gcttagtctg 
agatggacag 
aaaacgttgg 
aagtaattcc 
agcaggcggg 
aagtattggg 
agacttgagg 
aggaaaaaaa 
aagaaaacag 
tgacttgtca 
ctcttttaaa 
aggcagcttt 
cctggctagg 
ttataagact 
cccatcctgc 
cttatagtta 
aacacaaatg 
nnnnnnnnnn 
aattcagagc 
ttcattacat 
catccaaggc 
cagfcttgtac 
ctcagcactg 
gtcataaatt 
tcttctagag 
gattctaggg 
cacatacata 
cattcaaatt 
aacaaaacaa 
ctggtaccca 
ctaccacttg 
taagctgtct 
catagcatgg 
agtaacgtac 
ataaaaaccc 
tctgtgtttt 
nnnnnnnnnn 
nnnnnnnnnn 
agaaacaaat 
aaggctttaa 
caatttaatg 
ctcttataat 



cgatgaactt 
tgttcagctg 
aagcctttcc 
actcctccgc 
tcttcctaaa 
tctggggaaa 
cagaaaggcc 
nnnnnnnnnn 
nnnnnnnnnn 
tttcttctgg 
caggaactcc 
tggttgcagc 
cattatcttc 
gactccatgc 
ttttttaaat 
aattgtttct 
gcaagcacct 
gctctgaggc 
tgagcaccag 
ctgtgtcaaa 
ttgtctgtgt 
acaaaattag 
gtcttgaact 
gctgtgtgac 
ctcatgacac 
agcaccagag 
gccccactcg 
gactaaaagc 
ccagcttggg 
aaaaaagctg 
ggttactttt 
ttggagaaat 
tttagtttaa 
ctagagtctt 
caagctctta 
ctgtttttct 
aggtcattgt 
attaaacatt 
cacattcttn 
nnnnnnnnnn 
tgatatgtag 
tactgttaat 
agaaggaagg 
aggaggccct 
gtggggggtg 
caaagtctgg 
aacctggtgt 
ggatctgaaa 
catgcaaccc 
cttcctcggc 
agaatcatta 
gggagattaa 
ggtgctggga 
ctcccaacgc 
taatctgact 
tagatttaat 
tgccagccat 
gcaaggccag 
nnnnnnnnn n 
nnnnnnnnnn 
gccaaataaa 
atggtaagat 
caattatgta 
tcacatagaa 



gtggacctgg 
cgtggccagc 
ttaacccgaa 
cctgcagaga 
tggtatctgt 
gacctttctc 
cccattttct 
nnnnnnnnnn 
nnnngcacar: 
gacgtggtag 
aggctggcat 
attaatgaac 
tcataaacag 
cagactacca 
tgtatcagtg 
ctcctaccaa 
ttacccctaa 
aggctctccc 
gtttctgctt 
ctcagtcact 
ggtgatttgt 
ttgagtatgt 
catgatcttg 
catgctcggc 
tttacaggtc 
gctgagacag 
gttaggttaa 
aggaggatct 
cgctgtggga 
gaggaagtga 
ggcaacaaat 
gatttaatta 
ttttatctca 
ttcactctct 
ggacagtgtt 
ttctcaggtc 
aagtaccggc 
ggattggatg 
nnnnnnnnnn 
nnnnnnnnnn 
tactaactcc 
agtggtaact 
ctggcaaaaa 
tggttcattc 

gtcctggaaa 
gatccccagc 
accctcttct 
aaacaaccca 
tatatagtgt 
atgttttgcc 
aagaagacat 
acctgggtcc 
ctaaagattc 
tcagggcgcc 
gcctggagct 
gatggtaatc 
ccacaatcta 
nnnnnnnnnn 
nnnnnnncga 
tacacatatt 
atcatattct 
tattcaggag 
gaaaaaaaac 



cagacacagt 
atgaacgctg 
agctcatcac 
ggaaagcagc 
ttctcacagt 
ggtctttagc 
ttttaaagct 
nnnnnnnnnn 
cttgtgaagt 
catgtgactg 
tagctgcaga 
tagaagaatt 
tgttgccttt 
ctcaaccact 
gccaccaaac 
gtgggcccca 
gccatctcat 
tatatagccc 
gtccttacct 
aagctttgcc 
caaaccactt 
aggtcttttt 
cctcaacctc 
tgtgtgtttt 
ttccatgttt 
gaggatctca 
tatcatcact 
gaagttcaag 
ccctgttttt 
atgtgagtgt 
aggtgctttc 
agacttttct 
ttgttagtgt 
cctccacagc 
agctgtagct 
atgatacagc 
atgtgtgttt 
gaagacatgt 
nnnnnnnnnn 
nnnnnnnnnn 
tactcaatga 
atgtaccaaa 
tgtatgatga 
catttctggc 
gtgaaggggc 
gaggactaaa 
acatggtgcc 
gccctctgta 
tatacataaa 
ttaccaaacc 
ttcatgtatg 
tagctcccct 
cctgcaaaag 
ttgtaacaca 
agattttgag 
tcaagcaaca 
ccagaacttg 
catagcacgt 
nnnnnnnnnn 
ataaatctac 
gtaataaaga 
tggatgaaaa 
atctctggtt 
catgaaaact 



tgctggccag 
caaaagcaga 
aaggagaacc 
atcaatccgc 
ctccaaatca 
cccgtgacta 
tctannnnnn 
nnnnnnnnnn 
gtccacatct 
tcactccagt 
gctggagcag 
tttttgtctt 
ttatttaagt 
gagctacacc 
atggggagag 
ggtttcaaat 
tggcttcatc 
tggctggcct 
ccccagcact 
aagccatagc 
ttgttccact 
ttctggaatc 
acgattattg 
agcatgcatt 
gatatgtttt 
aggtcaacct 
gacttcagga 
gtcatcttta 
aaaccagaaa 
tcacatagtc 
tttgaaggct 
actgagtgcc 
ggtgtgcttg 
tcctggagtc 
tattaagttt 
aggaaaatac 
agcataatga 
agttttggtt 
nnnnnnnnnn 
nnnnnnnnnn 
atcctttgtt 
gagtcaaata 
tctgggatgg 
aatgcataga 
gggaggttaa 
cgattaagag 
tcacgactgt 
gatacagaac 
atattttttt 
tcaaaaacaa 
tctgcccacc 
ggaatggaga 
cagtaaatct 
gcatgatgag 
cttaatgctt 
aaattaactg 
agaggcagag 
tgcagtannn 
nnnnnnnnnn 
acatgtaaaa 
gataattgtc 
gatctaatgt 
ggcttttgaa 
gccaaacatt 



10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
• 11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
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attagaatac tccacagatg gtattttggc agcacataca tcgaagggct gtgaaagatg 13860 

tgtagatcat ccacgccttg ctagggagag ggcgggtgtg tgtggggggt atagctgttt 13920 

gggaaaataa cctggtaatt cctcattagt taaatcatag tcagaacctg gactagcaac 13980 

ttctctctaa aatacattca ccctcagcat ctgcattgcc aggaaaccac tcctagcagg 14040 

atctgtacgt ggatcaaggt agtagcatct gcatttaatt gacattctcc taaatgcttt 14100 

aaattatctc tagattactt atagtagcca agatgatgca aattatgtta cactgtatta 14160 

tctggggcgt aacaagaaaa tgtctctact caggttcatt caggtgcagt acttcccctg 14220 

aatacttctg aatacacgga tcaagaagcc acagaaagag ggctaaccat atacaagcat 14280 

atagtacact aataaccatg tacaaccata tagtacacta atattcagtg cattactcaa 14340 

aatgcaaaca gatggaaaca atccaacagc ctgtaagctg aaaaacaaga taagcaaaat 14400 

gtgctgggcc tagaggccca ggtctataat tccaactaag gtcgaggcag gaggatctca 14460 

agttcaaggc cagcctagac aacttagcaa gaccttgtct caaaacaaaa agtaaagagg 14520 

ctgaggatat agctcagtat agagcatctg cttagcatgt gcactgacag ccgtatcaca 14580 

gaggaaaaaa aaaaataagc aaaatgtgat ctgtctgcac aacaggatat cacagccccc 14640 

taccacaggg gaacgacaca gtaacacaac aaaaacttag ccctgaaaat actatggtaa 14700 

ataaagaagt gtcactgagg atcaggaaat gcatgactcc atttacatta tatagaaatg 14760 

agaagatcag tgagcctcta ggactcaaga gatttgggat tggcagctaa agggtactgg 14820 

gtttctttat gggggtaaga aaacattcta aacttaactg tgagaatgac tactcaacaa 14880 

tgtcaagtgt tcaaaaatca tacttttttt tttttttggt ttttcaagac agggtttctc 14940 

tgtgcagtcc tggaactcac tctgtagacc aggctggcct cgaattcaga gattcacctg 15000 

cctctgcctc ccaagtgctg ggattacagg catgcgccac cattgtccgg ctcaaaatca 15060 

tacttttaaa aattgcccag tgactcatga atacaatcag aggcgggaga ggacagtggc 15120 

aaactcagga taccagtgtc ttttatgtct gctgcccaac tatcaatttc ccatagttac 15180 

cagagaactt tttggtttgt ttcatcttat ttgttgcttt tggtagaatc tcaatatagt 15240 

aagatacaag gctggcctca tactatatag ctgaggacga ctttgaactt ctaatcctcc 15300 

tgcttccatc tcccaagtgg tgggattaca ggggtgtacc gctatgccca gcaagcacaa 15360 

agccatttga accacacccc agccttttca gagaaacctg tacaagcctt agtgccttag 15420 

catattaagg caacaaaaga cataatgcgt ggctaccata gagtgtttgc ctaccatgtg 15480 

tgaggctcta ggctaaatgt ccagcactta taaaaaagag ttaaaaacac tcatgactca 15540 

aggatgacta tgcagtcttg tgtacaaagc cccgcattca atccccagca ccgtgcacat 15600 

caggcaggct ctgtagagga cccagcttaa ggtcatcctt aggtaagtta gaggccttag 15660 

atggctacat tagatgagac cctttctcat aaacagaata aataatttaa agctcctgat 15720 

caaacactat gccttcccat cacactcaga ataaagcact ctactggccc tttaaggact 15780 

gcccatctgg aagagaaacc taagttacat tccttgcttg tgtcatatgt gataacaaac 15840 

tcactggaaa tacgaaaata cagtcttaag cttggtcaga aagcttcccc agcaacatga 15900 

tntcagagga cataatgcag aaagtggaca aatgcaaann nnnnnnnnnn nnnnnnnnnn 15960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16020 

nnnnnnnnnn nnnnnnnnaa tcagaggaca tctttcagga gttagttctt tcctccctct 16080 

atagctttca gggatcaaac tcaagtgtgt actgagcgct tatgcccagt gcgccatcgc 16140 

accaggcctg cttctttgtt ttttatgggt ctgaatcaat tagcaccatt acaacaatgt 16200 

tgacaatcag caagtacctt tctctacctg gctagtaaga gaagtaagtg cctttggtgt 16260 

gtgaacgcag tttctcttgt gaagtgcatg gacttgatct ttgctcacaa cgttttttag 16320 

gtccttaagt tgcttgggtt ttatgggaaa ggctcttggg ttttttgaaa agattttact 16380 

acaacttgat ataatcatta tttttaatcc tttaaatagt atgacttatt ttaacagatt 16440 

aatattgaac tgttctttca ttcttacata ataaatcctg ccttaaaaaa taatcctctt 16500 

agcttccttt ctctattttc aaatttgttt tatatttttg catgattttg aacatttata 16560 

aaagtaggca gacaacacag tagaaccaag tccccatata gctgtgcaca tagcttcaga 16620 

ttattgcctg ataataggtc ctgttttgtc tctgttttct cacagggatg tgttattgtg 16680 

tgtgtgtaca catacataca tatatgtatg tatgtatgta tgtatgtatg taatgaactc 16740 

cctttaacaa aacaagtact gggctgggaa gacagcacag tfcagttatgt gtttaaccgc 16800 

acaagcatga caaccagagt tgagatcccc accaaccgca taaaaagctg ggcatagtgg 16860 

cattgacctg tagccctggt gctggatgaa agctggggag gcaggtagat cggcagagct 16920 

tactggcaac aaatctgccc agtaggtaag ctctgggctc agacatccta tataggaaaa 16980 

agatgaaggg cgaggcgcag cggcacacac ctttcgtggt agtgcttgag aggcaggggc 17040 

aggccagtct ctgtgaccag cagcctggcc tacatgtcaa gttgcaggac agccagagcc 17100 

accacctact gagactgtct cagaaataag ttttttaaaa aattgagatg aaggagctgg 17160 

taagatggct tagaaggtaa aggcacttat cactaagcct gaagccccga gtttgaccct 17220 

ggaccccaca ctgtagaacc aactcctcca agttcttctc agacctccag cagagcacaa 17280 

gtgtatgcag acacacacac taagtaagtg aatgtaaaaa acatgacgta gtggcactgg 17340 

cctttaaacc cagcattggg aggaagaagc gggtggatct cttgagtttg agaccagctt 17400 

agcctacata aggaatttca aggcagccag ggctacctag aaagtagctg tttatgaatg 17460 

aatgaataga aggaaggaag aaagagagac agacttaaaa aatatatgct ggagagtaac 17520 

agaagaggac accggcttgc tggtgtcttg acctctggct tgtacacata cacatgtgta 17580 

gtgcatacac ccacatacaa ttgtactcag acacacacaa acatgtactc attcatatac 17640 
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tgcacacctc 
ttccaaccac 
tgtatacatt 
tctggaagcc 
cgacagcagg 
agccttgtag 
ctatcatcct 
aattatttta 
agtgtccaca 
gtaagccacc 
nnnnnnnnnn 
nnnnnnnnnn 
cagacttcct 
cactcaaaaa 
gatgcacaca 
gcccatgtga 
attgggattg 
aatgttttca 
taagtaaagg 
ttgcttgtaa 
ataattgatg 
tgctttgcta 
taaatgatga 
aacgtccata 
accatagtct 
cattaacatt 
tgagagaaat 
aataatatta 
gtagtctcag 
aatgtaagga 
aaccttctaa 
aattgccaca 
gcctaaaaat 
tcctcctcct 
tccttttctt 
taggtaaccc 
atattaaagg 
aaaaatatac 
tgtacctagt 
agaggctttt 
aagcggttga 
atatttggtc 
ttgaggtttc 
ctctgcctgc 
tgcctcaatg 
ttaccgtgat 
ggaccaggga 
ggggtgggca 
tcagatgaag 
tgttcccacc 
ggagagatgg 
cccagcaacc 
gtgtcagaag 
aattcttttt 
acataaccct 
gaagctcacc 
nnnnnnnnnn 
nnnnnnnnnn 
gatctcatga 
ctgagcatct 
tatatgatac 
aaccatcact 
tctggaactc 
tcccaagtgc 



aacactcaga 
tcatatgtat 
caaagttgag 
tgctcgcagc 
tcatttgtct 
tagtcacctg 
catcttgcta 
attttatgct 
gaactcagaa 
atgtgggacc 
nnnnnnnnnn 
nnnnnnnnnn 
gtttctctca 
gcagggagcc 
tccacagtca 
actttagaac 
tgttaaattt 
tccgcggaaa 
tttatagttt 
ggtttccttg 
gttaggagta 
taattcaaca 
tatctgtaca 
gcatagatac 
tataatatgc 
tttttcatca 
gaccattttt 
gcctacatat 
actaaaccag 
tgctgtgtat 
aacagtctta 
ttcaagctat 
atctaaaggt 
cctcctcttc 
catcctcctt 
gtaacttact 
tgtgfcatcac 
gtaaataaag 
ctaaggccaa 
cttctcttgg 
ggagtgggag 
cctagttgat 
aaaatttaat 
tgcatgcaga 
atcataaatt 
catggtgtct 
gtggcacttt 
atgagaccct 
atgtagaact 
ttgctgcccc 
ctcagcagtt 
acatggtggc 
acaggacagt 
aaaaaagaat 
gactaagaca 
tcagcatgaa 
nnnnnnnnnn 
nnnatctaag 
cagaggttgt 
ctccagccca 
atattaatca 
gctgtcgttt 
actctgtaga 
tgaaaacaaa 



aaatgaaaaa 
gggttgtaaa 
tcaggatcca 
tgtctttttc 
ctaagtgtta 
tgttctctgt 
tatttcattc 
tatgaatgtt 
gaaagtgtca 
tggaaaccaa 
nnnnnnnnnn 
nncctttggt 
tggaacagtt 
ctcgtacaaa 
ttacccttct 
tggcttgttg 
atagattcat 
aggttgtctt 
tctttatatc 
gttggtattg 
taaataaaag 
tcttttcatt 
gagtaatcct 
acatcttata 
agcgtgcttt 
caaatacaca 
cttaatctgt 
ttcttctgtt 
gagcttccta 
atctgccagc 
ccacttagag 
aatattgtta 
aaacactggt 
ctcctcttct 
tcttttctta 
atgtatgtag 
catatccagc 
atggatgcca 
aatagggatt 
tctcactaga 
gtagtgtgac 
agaactgttt 
gctagaccca 
gctctcagct 
aactgtaagc 
cttcacagaa 
taggaggaat 
cgtcctaacc 
ctcagatcca 
aattaaatgt 
aagagtactg 
tcacaaccat 
atacccacat 
tgctttggtc 
acaagtgagg 
gcttgtcgaa 
nnnnnnnnnn 
tacactgtac 
gaactcagac 
aaataattct 
acttataagt 
tgttttgttt 
ccaggctggt 
tgtgtgcacc 



acaggtacca 
tgcttatatc 
acgtaaactt 
tcctcgtacc 
gtttcccatc 
aaaatggctt 
aatatatgta 
ttgcctaagt 
catattctgg 
atccaggcgc 
nnnnnnnnnn 
caaagatcct 
tccccctgag 
tgagacttcc 
tcagagcctt 
tgtttcataa 
ttagggagag 
gccacttact 
agtcttgcac 
tgtataaaca 
gtagaatttt 
tttatgccaa 
ctgatccagt 
ctgttgagta 
ggttcaagac 
ttgaccttta 
taatgtaaaa 
ctctttttca 
tctgtttctc 
taggccttat 
accatgttca 
tctaagggaa 
aattgctgtg 
tcctcctcct 
tttttgaggc 
accaggctag 
aacacttgct 
attactaaat 
ttttttctac 
taggaaggag 
cgagaatacc 
agaaagatta 
gtctttcaag 
actactctag 
aagcctccaa 
atagtaacct 
ggccttgtta 
atgtgagaac 
cctgcaccat 
tgtacttata 
actgctcttc 
ctgtaatggg 
acattaaata 
atggtgtctg 
aaaggtgttg 
gcgnnnnnnn 
nnnnnnnnnn 
tgtcttcaga 
ctttggaaga 
tactagtaac 
acatgattat 
tgttcttttg 
cttgaactca 
accacctctg 



tttacacctc 
tgtatgtgtc 
ggatagtagt 
ttttcccctg 
ctctctcttt 
tgccgtgtta 
tatattacaa 
atattgcacc 
aactggaatt 
cannnnnnhn 
nnnnnnnnnn 
cagtttcgac 
atttactagt 
tagctatata 
tgtcatgtca 
aaagtgtagt 
ctgacaactt 
tgggctttct 
atttcctgtt 
ttttctccca 
aaaattgctg 
tctacttgac 
atttgcacat 
catatatatt 
agttgccctg 
tcaaattttt 
tttttataaa 
actcttagaa 
tgttcttaag 
atacaaaagg 
aacatatggg 
tatcttcact 
tttgttgatg 
cttcttcctc 
atgatttcac 
cctctgtctc 
ttgagatggt 
tgttacttcc 
tttgcaagtt 
agagaggagg 
cagtaggctc 
ggaagcatgt 
ggagggggcg 
tgtcaagcct 
ttaaatgctt 
gtggtgattt 
agaggaagtg 
cactcttctc 
gtctgcctgg 
agaattgttt 
cagaggtcct 
atctgatgcc 
aataaataaa 
ttcacagcag 
tgtgacactc 
nnnnnnnnnn 
nnnnnnnnnn 
cacaccagaa 
gcaatcagtg 
atggaacaat 
gcacatttat 
aggcccggtt 
atgatctgcc 
gctatcactg 



cgtgttcggt 
tgtatatttg 
gggttgatgg 
tttgtttcta 
tgctgatggt 
tttcaatatg 
gatagattaa 
ttgtgtgtct 
gcaggtggtt 
nnnnnnnnnn 
nnnnnnnnnn 
tttgattacc 
ggaagaaagg 
attaggccga 
agtgtatttc 
tgggctcttg 
tacagtatta 
tttatgccct 
agatttattt 
attacatacc 
tatgaatgac 
atgtttaggt 
cttactttct 
taaggtattt 
tgttcctcaa 
aaaactatct 
aatagttata 
tcagagtatg 
tcacttatat 
cacccatcac 
cctttgaggt 
tctagcagat 
ctgctcttcc 
ctcctgcttc 
catgtagccc 
ctgagtgctc 
tagaggaaaa 
agtcaaactt 
ggctccatta 
gaaggagaga 
atatatttaa 
cttaggggct 
gtctgtctct 
gtgtgcttcc 
tcttttatag 
taatatgcct 
tgtctctgtg 
ctattggcct 
aagctgcctt 
ttggggggct 
gagttcaatt 
accttctggt 
taaataaata 
taaaacccta 
tggatctctg 
nnnnnnnnnn 
nnnnnnnnnn 
gagggtgtca 
ctcttaactg 
caagttttat 
catatcgtgc 
tctgtgttgt 
tgcctctgcc 
ctgtcttttt 



17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 
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ttttttttta acagttattt atttcgtgca 
catggtatgc atgtggaggt cagaggacaa 
tgggttctag gaaccaagct caggttgtta 
gccatcctgc tggccctact ataggtcctt 
caaacaaaaa ataaacctca aaaaacaaaa 
aagtcctgaa cgaatgtgtt ctgttgacat 
aagccttcct ctggtcctgt gaggatgctg 
tttctcctct ggatgactgt aaccccagat 
accctgccta cttaagctgt acacaatgaa 
gacactgtcc atcagtgaga gcacagccta 
cgtttccaga accaagcctt gctggataga 
gataaaaagt tttcaaggag ggccaggtgt 
ggaggcagag gcaggcaaat ttctgagttc 
aggacagcca gggctataca gagaaaccct 
aacaaacaaa caaacaaaca aaaagccaaa 
ccaaaaacca aaaaacaaca acaacaaaaa 
agaatttcac tgtttaaaaa aatggctgaa 
aggcattcat gtgtgtagtg gtttaaataa 
tgcttagtca ttgagtagca gtactagaga 
ggagtggcct tgttgcagga attgtgtcac 
tcacatccaa agacattcta 
acacaccttt aatcccaaac 
ttgaaagtga cgtttaatta 
agaggaaaga gaggcagcat 
accagggcat ttgtacagag acaggttgca 
agagaatgag aaggagccag gagattagga 
gcattttagt cagaaactga gagaagtcaa 
agcagcaaca gctgagttaa actagccaac 
ttactcagca gcaaatctca gaggctaaaa 
ggctagacgc ttccagggct aggcctaggt 
gacaacagtt aatacagaag actatgtaca 
ccagcgtcat gcctgtctgc attgttagga 
caacttttcc agagatggtc cactgttttt 
gggaaggttc caagagactt catcccagga 
gtcactgtta tttagtataa tgactcctga 
tgcagatcaa cataaagatg aactttcaca 
ttataaaatt cctgatttga tttaaataat 
gttggtttgc cagaaagatg taataacgtc 
ggtggcagat gcctttaatc ctagcacttc 
gaggacagcc tggtctacag agtgagttcc 
gtcttgaaaa acaaaacaaa aagaaaagta 
agctttcaca actacaccaa aaagagacat 
tattcattct agatcaggtc caaggatgaa 
catggataaa ctgttgtttt gagcttaaag 
acattctact gtaactttcc tttttataaa 
ttgtgtctat aaaggtatag aaggacagag 
tgctccatcc acccaaataa atatgtgtgt 
atgtatgtat atacatacat gtgtaggtag 
acacttggga agttgatgag acaagtgaga 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nrmnnnnnnn nnnnnnnnnn nngaattcac 
gaaccgtgtg cctctgcctc taaagtgctg 
tagtttaaat gtttcttatt tttttgttta 
accatgtgga tgcatggtgc ccttagagtc 
gagtgacaga gggttgtgag ctgggacttg 
ctcttaatag ctgagcctta tctccaggcc 
tttattttgt tttgagacag ggtgtgtagc 
tcccctcaaa gtaaagatcc tcctgcctct 
accacatgct cagcacagcc atgattcata 
gccttagaga taaataaatt attttttttt 
aatgcacact gtagctgtct tcagaccccc 
ggttgtgagc caccatgtgg ttgctgggat 
ggtgctctta cccactgagc catctcacca 
gtaaggtggg atcatctcag tctccgggaa 



ggtctatgtg 
aatgaaggta 
agagtgatga 
aagagggagc 



tgcatgtgtg tataagcatg 
ctttcaggag ttagttcttt 
gacttgcatg gcaagtgcct 
atataaaaag atcatatgcc 
ggaccatata atattgtggg 
gtctgtactt cagacccatg 
atagtctgtc taaaaactag 
tgttcctctt cagagactgt 
tgagctgagt ttccaggtta 
gcctaacagt acacatgtct 
gcatatttgt ctgtttggct 
ggtggcacac gcctttagtc 
gatgccagcc tggtctacaa 
gtctcaaaaa accaaaaaaa 
aatccaaccc cccccaaaaa 
gtttttgagg tttaatttat 
taatatttca ctatccattc 
aaatagcccc cataggcttc 
gggaattgaa ggtgtggcct 



tttgaggtcc cagcaacaag 
atctggctgg aattcagaca 
aagttagttt ataaaaagaa 
attagagaaa gatctgctgt 
atggcagagg gagagggagg 
gagctagaac aggtgaagac 
cagattgcca atgttaatag 
attgaatcag ttagcttgga 
agaatccaga aagaactaga 
acatcttaga' cctagattag 
tagcagacgg agagagtaat 
gacatggata tgaacctctc 
gtcatctagg aaaggctaag 
tgcatggctt ttgagaggcg 
ttgctgctta attacgacat 
gctttagccc atcctattgg 
aattaatgct gtttagatga 
tttaggaaga aagctttagg 
agaatcaaga atagaatgtg 
ggaggcagag ataggcggat 
aggacagcca gggccacaca 
agtaaaggct gcataataaa 
gcttgggaca aatttgtgat 
gccacaagtg tgtgatatga 
aataaaacac tgctttgaaa 
ttttatctat gagataattt 
agaaagaaat aaggtgtggc 
gtgtgtatgt atgtatgtat 
gtatatgtgt atgtatataa 
ggttgggccc ccnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 
tctgtaaacc atgctggcct 
ggattaaagc atgtaccacc 
tgggtctttt acctgtatgt 
cagaagaggg tatcagatcc 
aacctaggac ttctaaaaga 
gtcccatgga tttggggggc 
tcatgcttga atttactatg 
gtctacagct gctaggattc 
acaataaaaa aagaaagaga 
taaagattta tttatttatt 
cagaagaggg agtcagatct 
ttgaacttcg gaccttcgga 
gccccgagat aaataaatta 
tcttgcctgt tactccttcg 



taacgtatgc 
cctcccactg 
ttaccacaga 
gggcaaaaac 
ggagtggatg 
ggaattggca 
agatcacagc 
ccaccaagct 
cagcacagta 
gctttcttca 
tatttcactt 
ccagcactcg 
agtgagttcc 
accaaaacaa 
aaaaaccaaa 
tgcatgtcac 
acgtatttgt 
tacagttgaa 
tattggagta 
gttgctctga 
tgcccttaat 
gcacccatgt 
cacagagcag 
aggggttttc 
agaacaagcc 
gctaagcaga 
aaggagtttg 
aaagatgagc 
actgcatgga 
aagccttgga 
agccacttct 
ggcaggcaag 
agctctgaga 
gccttttctt 
gcatatttcc 
ataaatgatt 
agatagttta 
gctgggcagt 
ttctgagttc 
gagaaaccct 
gaatacaatg 
caaggaaaaa 
tgaacaagac 
ttaactatca 
tctaaagaac 
atctgggctc 
gtttatctat 
gtatgcatga 
nnnnnnnnnn 
nnnnnnnnnn 
tgaactcaga 
acaacccagc 
atgtgtgtgc 
cctggaactg 
gcagcaggtg 
tttgtttcat 
aagccctgac 
gaggtcttgt 
gacctaaatg 
tattacatgt 
cattacagat 
agagcagtcg 
taatgtatgc 
ctctcccttc 
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tattcatgct 
gtggcaggga 
ctgggtggaa 
aattttattt 
ggacctgcca 
tttacttcca 
aaatagtgac 
cccgggaagc 
tggtcagctg 
gctgagtacc 
caaaggatac 
aggaggcaga 
gatgggatta 
aaccccaccc 
gagaaagtcc 
gccctcgaag 
cctcccaccc 
tagaacttgc 
gggagcaggt 
tctctgacag 
ttccatctca 
tgccccagct 
cagttgtcaa 
atcccaacac 
gatcaccgca 
aattaccgtg 
aatagctgcc 
tgtcagagct 
tccccataag 
aaccagtaag 
caccaaacca 
gtcttttccc 
ccccacccct 
ttttgtagac 
cccaaatgca 
tctagctgca 
tggacacgca 
tgggtgggta 
nnnnnnnnnn 
nnnnnnnnnn 
gtttggttct 
ctggccccac 
cactactctg 
gctaacaagc 
tctcccacag 
taaaataact 
gagggtggtg 
gaactgcccc 
gacaggtgtg 
tgacaggaca 
aaaagccatc 
gtgtcctggg 
agtcagtgag 
tttttttttt 
agacagctcc 
tgctgggatt 
tctctccagc 
gaggagagtc 
ccataactac 
ccctcrtggcc 
aagagggcag 
aaaggagatg 
ccctttggct 
cggcgctcac 



tgggtaactg 
ccactgcgcc 
gtagagggct 
aaattttata 
acggttgtca 
tactttctac 
agatgacttt 
ccaggctaaa 
agggaccctg 
caaagtgctc 
aggcatgagg 
gctccaagag 
gaaaggcatg 
ctgggtgtcc 
caagaccagg 
actagcttgc 
tcgtctcttc 
cagtgctcac 
atgcctcaga 
ttcctctcct 
aaaaacaaac 
tggctgggaa 
ttaattgtaa 
aaagaaatga 
cattgtatac 
cagtgattca 
cttgaccccc 
tcctgtggga 
aggagtcacc 
atgtgagcag 
aaaacaaaca 
acataaagtt 
ccccgagaca 
caggctggcc 
gggattaaaa 
tacgtatcaa 
aactgtatat 
gggaagtggg 
nnnnnnnnnn 
nnnggcagga 
gagcacctga 
tgcccactcc 
ctcctctccc 
tgggcagtgt 
tgagacactc 
agggggactt 
acagctcagg 
tggacaacct 
accccagtac 
aggttggagc 
atggcctgag 
gacctcttga 
gtcaagggaa 
tttttaagat 
agaagagggc 
tgaactcagg 
cccctctcag 
atgctgaagt 
tgactccccc 
ccacccattc 
gcaggggcag 
aagaaaaggc 
cttcccccta 
cggagagaac 



gccctggctg 
cacagggctc 
gcatatataa 
cagctctata 
agggtaatgg 
attaccccac 
gaaggaaaat 
tgtaatctgg 
gtgaccctct 
ctctgggctc 
actgaggcct 
tccaaccaag 
tttgcaaata 
cttccctgcc 
ctggctggag 
tcgcccacac 
ctccaacctc 
tgtaatgtcg 
tctccctcta 
cctgaatcct 
actgggaagg 
ggggtaactt 
atttcaaaat 
taaacattca 
atgtttttga 
agataaaaac 
ccacccccac 

gggtggtttt 

cctgtcccac 
ggatgatacc 
aacaaacaaa 
aagcacacaa 
atgtttctct 
tggaactcac 
gcatgagcca 
aagatgacct 
gcctcagtac 
ggggagggta 
nnnnnnnnnn 
tcctgtgttc 
aggggaccaa 
caccctctgg 
agatccacgg 
cacctccctc 
atgtccactg 
gtcaatcact 
gacctgcatt 
ctgcagcaac 
ctagtgccag 
tggtagatgt 
ccagcctgcg 
gggtagcatg 
cagctctagg 
ttatttattt 
atcagatttc 
accttcggaa 
tcctgatgcg 
ccttcctacc 
acccccattc 
ttccccagcc 
cttctgctgg 
ccttcccatg 
aacctctcac 
tgttgagaaa 



attgatgaga 
cctcaggatc 
cacgatctca 
taaacgaagg 
cgatgtagtg 
aaqtggcaag 
tgaatcggta 
gtcaggcctc 
tgggccagca 
caagggcctg 
gacgaaccca 
aggcaggaag 
ctttcaaatt 
tcttgccaca 
ctcctgatag 
cgccagatgt 
agtggagggt 
tccctgtagg 
ttcgctgccc 
gcttgttggc 
tgttgagaga 
acaatgctct 
agctagtaga 
aggcgatggg 
aatgtcaggc 
tataatttta 
aaggtccttc- 
ggtgtacaca 
gatgccatgc 
tgtctcttat 
caaacaaaaa 
agtagccacc 
gtataacagc 
agagacacag 
cgaactaacc 
agttggccat 
aggggaacgc 
t g gnnnnnnn 
nnnnnnnnnn 
atgtgcaaca 
gcaggctgat 
catccccatg 
agccataggg 
aaggttcctt 
gaagatattg 
acactcttag 
ctaccaaata 
tgaaccctct 
gtgagagagt 
ttgggccacc 
agtctcctct 
gtcaccaaaa 
ttctctgtgt 
attatatgta 
gttacggatg 
gagcagtcgg 
acagggcagc 
ccaccccttc 
ccttgcttcc 
ccacctgatt 
gcctgctcac 
ggtccccatc 
ttttgagtac 
agggagaaca 



gctgatttcc 
ctcagtacag 
actttatttc 
aactattgaa 
attttttttc 
tattatttta 
aaaagaaagc 
ccaggcctgg 
gggacgggga 
cacagagact 
gctatcattc 
ctttgggacc 
tacgatgcac 
cccaatagct 
gttccaccct 
ctgtgtcttt 
cccctgcttc 
tgtcatggtc 
tttcaggctg 
atgcgaacag 
cagagagcat 
actgcccagt 
gaggatttta 
tatgctaatt 
tgtaccccat 
aaaagctaaa 
ctgtttgtcc 
gacactcctt 
agggccacat 
aacaaacgga 
cagggttggt 
atttatttat 
cctagctgtc 
agattcacct 
agtaccccag 
cactggaaag 
cagggccaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcgatgcaa 
gcccaggcca 
atgaacatgg 
tccccagatt 
tcctgctctg 
tagccattaa 
ccccggactt 
gccatgtgtc 
gtggtctcct 
gctagggcca 
aaagagaaca 
gcagttggga 
tcctacaagg 
ccctcacgga 
agtacactgt 
gttgtgagcc 
tgctcttaac 
aaaggccttg 
cgaacccctg 
actgatccgg 
gtacctggtt 
tcactggctg 
ttgccaagac 
ctgctggccc 
gagaactcag 



ccactgccct 
agctgcacag 
tttaaataaa 
ggttcagcaa 
ccccttccat 
aaatgaaagt 
tgagagacca 
ggtctcaaga 
ggagccggaa 
gtgtgggaat 
gtcctagaac 
cgagatgggc 
actcactgga 
gacatcactg 
cctgcagagg 
ctctcttttg 
ctggggaaag 
ccccattact 
tctcagtttc 
gctcaatatc 
gggtaatggg 
agggtagctg 
gatgttccca 
gctctgatct 
aaatatgtac 
aacagaagga 
agccacttaa 
cctccctcct 
gcgtgatatt 
aaaaaaacca 
ctgtccctgt 
ttgtcccctc 
ttggaactca 
gcctctgcct 
agctcttgac 
agaggcccat 
aaaatgggaa 
nnnnnnnnnn 
nnnnnnnnnn 
gctgtgtagt 
cgggttcttt 
ccacagatca 
catctctgca 
agcagcagtg 
attcctgtgc 
ctgactcata 
cctgatggag 
agttcttctg 
cactaagggg 
ggtcagtagt 
cactcttgca 
acagatcaga 
cctttttttt 
agctgtcttc 
accatgtggt 
cactgagcca 
tcccagatct 
aacatcagcc 
tcctcctctt 
gtccaacttg 
tagaaatgag 
ataggtgagt 

ggg a 9 atcca 

cgttcctccc 
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tctccaccct 
actgaccact 
caggacaagc 
ggagtgttag 
cttgaccgca 
ctctgagagg 
tttcgtctta 
gaccttgcgc 
agttcttcct 
gcatgatgcc 
ctttctcagg 
cagccagaaa 
gaactacaag 
ccagctggcg 
cctacccagg 
cctccacaac 
tttttctcgc 
cctggacctc 
ttcatccaga 
gctgcaccac 
gtgttctcct 
ctctcctccc 
gtaaatattg 
atcaaggcta 
gaggaggttg 
cagtgaggct 
cagccaaagt 
caccaccacc 
caccaccacc 
nnnnnnnnnn 
nnniinnnnnn 
gaacatctct 
gtgtgatcct 
tgcctacaca 
ccctttagct 
ggctcttagc 
ctcccatttc 
cttggaccac 
ccccctggag 
ctgtcctagt 
ccatcaccat 
ttctgacttc 
ggtccttaaa 
tcgttccttg 
taatccctca 
caggggcact 
gcagatttag 
tggctgagcc 
tctgggtagg 
agtaaatgag 
ggtggtggga 
tgaggtggga 
caagcaggga 
ggaaatggag 
atgcaaggcc 
ccaggacagt 
gggggaggag 
tcccatgtgc 
tgagaagaca 
tgttgctcct 
gacagtgcag 
ggctggctgg 
gaaatcttgg 
gaaaggtcag 



tctggcctct 
tcccactcag 
tgagtacgag 
ttcctacctc 
ccaggctgcg 
tgggatttcc 
cactgtggat 
atctcccttc 
gctcccatgt 
tggcccccac 
ctgccctggt 
cttctttgtt 
cctcatccca 
gaagggggga 
ccttatttca 
tgtgttcaag 
caagagtagg 
tgctcctcac 
ccccttgcca 
ccctccatcc 
ggaccagctt 
acaggacctt 
acctgtacat 
cccagaaccc 
ggcctggcaa 
cagagaggct 
cccaccacca 
accaccacca 
accaccacca 
nnnnnnnnnn 
nnnnnnnnnn 
ccttttttcc 
tctgcctctg 
atgtcttagc 
ggcttctctt 
cccagagcct 
cctctatctc 
aagacaaaac 
tcaggtctag 
gtgccctcac 
cctctcatgt 
ccagaaaaat 
attaagtcag 
ttcactctgc 
tctatagcag 
gccactcagc 
gcttcttgtt 
tcaggaggct 
tagcaactga 
atacccttgc 
gtctaagtca 
aatctgatgt 
caggtgctag 
actctaaagg 
ctggctgcaa 
gatgccatcc 
ggtccgggga 
cccaaatcag 
agtgggagcc 
cctcctccca 
gggaggacac 
ctcatggcta 
aatctgaatc 
ccctgcagaa 



cccagatttg 
acctcagctc 
gagcccccgg 
acgctcgaag 
gaacctggac 
ataagctttt 
cagtcctggg 
atgcctttgg 
agatgttgag 
cctagcaaag 
cctacccaaa 
gaccaatgaa 
cttctgctcc 
ggctctcagc 
ggcaccagct 
aaccttcagc 
cctccaggtt 
caccccagga 
aggtgccagc 
tacctcttca 
gctccccacg 
tgctctcagc 
ccggttaaac 
tgcctgagcc 
gccttctaat 
gtgtggcctg 
ccaccaccac 
ccaccaccac 
ccaccaccac 
nnnnnnnnnn 
nnnnnnnnnn 
tttttgagac 
cctcccaagt 
agcccttagc 
gtgaagggct 
tccttccttc 
cctaagcaac 
tgcagcctgg 
gggaggaaga 
ccatcctata 
acacgtgccc 
atctgatctg 
acaatccatg 
ttttggcagc 
gtctcccggt 
tgctcaggga 
tcccttccca 
gatttatcag 
gacaggagga 
agatgggacc 
cagatctttg 
gcatggggaa 
ggggatgtct 
gcagaaagtg 
taactgggtt 
tgtccaaagg 
gaggggtcaa 
gaggcacaga 
agtacatgtg 
gagccaagct 
ctctctctcc 
cgccgctcac 
agcctgagat 
cagaactctt 



cccccgcccc 
tgcctcaccg 
agcagtgcca 
gccaagcagt 
tcaaaacata 
tttttcacct 
ttcaaattta 
ctttaccctg 
gacccaaata 
ccacctgacc 
ggctctgaga 
tgactggccc 
aagttctgat 
ctagagagcc 
cttctaaaag 
cagggcctca 
ttggagttct 
cgctgtgaag 
tgtctacttc 
ggcttcttag 
ctcccccagt 
caacccccgt 
attgatatgg 
tggagaaggg 
gatccctcaa 
tgtaagggcg 
caccaccacc - 
caccaccact 
cacctcatct 
nnnnnnnnnn 
nnnnnnnnnn 
aatcttacta 
gctgaggtta 
agcaccaggg 
atgtcttctt 
aggttaaaca 
gaccttttct 
gctgtgtgtc 
cagggttcac 
gcacgcacat 
tccctcgcca 
agaagttagg 
ggacatgaag 
accaccagca 
gggaatttta 
gagacccctt 
gcccctccca 
agaggtgctc 
gatggtcacc 
cctgaagttc 
ttaccacgtg 
acacagaggt 
gaatgttcca 
agggtgtgca 
ggggaggcag 
gcagtgtcca 
gattctcccc 
aactgggatg 
tttcagatta 
gccttcaagc 
actggctcaa 
cccctggaca 
accccataat 
tggtccccac 



ccagcatctc 
tgaaacaggg 
tgttcctgta 
agactgctat 
gcagctgtgg 
gtacatttag 
aagccctcat 
tcttggtaat 
agaatctctg 
tgttgttcat 
gctaatctgg 
agacaccttt 
ccagggtgct 
ttcctttcca 
gtccttctgt 
tctccaatct 
agaggtttct 
ctgcaggctc 
ctctgctgcc 
cgcagcacac 
gcagccagca 
tcagcttgtg 
gggccagaag 
gtttacagga 
cataggggat 
cagagtgggc 
accaccacca 
accaccacca 
acccatacta 
nnnnnnnnnn 
tctatgaggg 
tatttaggct 
caggcatgca 
gtcaggaagc 
ccccttccta 
gcaccagttt 
gctctgactc 
ctcgcacatc 
gactcagaaa 
gctgatgtgc 
gacacatgca 
agtctgccat 
ggcaacaacg 
ggaaccaacc 
gggacctctg 
agaacaacaa 
tcacaggcaa 
agagaggcac 
ctgggcatcc 
ctcccggggg 
gttagactga 
ccaatgctgg 
ccccaagcca 
gcaggggctg 
tcattggcta 
agccagattt 
ctctgagtca 
ttgtggtctc 
aacccagtcg 
cacatggcag 
ggacagtttc 
gtttggggtt 
tgtacctccc 
ccatccccct 



cttcagcctg 
accttgcagg 
tccagaacag 
ccatgggttc 
acctcactca 
tcttcattct 
cttgcaagag 
tcatggcaga 
taaatactga 
ttcatccagc 
gctggcaggg 
ggacttacgg 
tcggggaagc 
tcctcagccc 
tatccctaga 
ggatatatga 
cctggagctg 
cctgaataaa 
caagcagcag 
gcagcacacg 
gggcctagct 
ttcagtgctg 
accctttccc 
gcagataagt 
tatccacagt 
tccagagtca 
ctaccaccac 
ccaccaccac 
anttgaggct 
nnnnnnnnnn 
ttacatttta 
ccccttgaac 
cagtcacatc 
cctcaactgt 
gcatggagac 
ttggtgggac 
tcatctggca 
attcctgtgc 
agaccactgg 
cccctccgct 
tcactaactt 
catcagctat 
agaagactcc 
tggctctccc 
tgttctcatc 
agaaatcaat 
cagcctccct 
ctctggtccc 
tctaccagga 
cgggggtggt 
ggactgaatc 
ccaagagcta 
caggaataac 
cacaggacac 
gccaggggca 
ctaggctcca 
aggttggcct 
acatccaagc 
gagacaaaca 
tgaatatgcg 
aaggggttca 
tttccctcct 
aacaccccca 
cagccctgga 



29220 
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29340 
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32460 
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32700 

32760 
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ggctgaactg 
ttcctttgac 
agggcagcca 
atgacctcat 
aacctagcag 
tctgaaaagc 
tcactgctgg 
ggtactgctg 
cctttcccaa 
ctaatagagt 
ggttcctaaa 
tgctctctgg 
tgtcttcccc 
gcctgcactc 
ccagcaggct 
tgtgattgca 
tccaagacca 
tcttcattgg 
ggggtaggtc 
tggagagact 
gctcactgca 
tgggactgta 
tggcccagtg 
cacatagcaa 
ctnnnnnnnn 
nnnnnnnnnn 
catgccttta 
cattgtctac 
ataaataggg 
ccccagcaac 
tcaaggtgga 
agtatgctcc 
ggtaacatgg 
ttcagtggcc 
tttatgctgt 
aataatttaa 
acatgtgtgc 
gggttgttag 
ttaatcactg 
ggctcagagg 
ccacatggtg 
agacagctgc 
ttgttttttt 
tgaaactcgc 
ccaagtgctg 
caccaccacc 
gaaaatgaca 
tgcataaaca 
ggtttggatt 
agtagaactg 
ccaaggggaa 
ggctgcttca 
gctgttgatt 
tgttagtaat 
tcctcttcct 
gcagagcccg 
aacagcttca 
ggctacactg 
tctgtagagc 
gggcccccgt 
gcgggaaggt 
cattccactc 
tcttgctgaa 
attaacacca 



atgggcagct 
tcagaccact 
gaccccacaa 
ttgtctccca 
gcctcaggca 
agcgatccga 
tggaagcaac 
agtccagccc 
caccattcaa 
cctgggcctc 
ctgcaggggc 
ggtgcccctg 
aggctagacc 
tgaggggctc 
tcagattaca 
tcattgtctg 
cacgttggcc 
gaccttcatt 
atgagtctga 
gtgagccagc 
ggtgggagca 
cctgacagct 
cttatgagca 
ggactcgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
atcctagaac 
atagcaagtt 
ccagttggca 
catgtaaaaa 
gacagacgga 
cagccagtga 
tggctcagtg 
actaaaggtg 
ggtacacaca 
aatgttatgt 
aatgcctgtg 
ctaccatgtg 
agccatctct 
ttaagagtat 
gctcacaacc 
agtgtactca 
tgtttttcga 
tctgtagacc 
ggattcaggg 
ccaccaccac 
cctagcattg 
cacataaata 
gtgtcccccc 
ttagggttta 
tctctggctt 
ccacatgccc 
tccccagata 
tgtgtgtgga 
catccacctg 
ttcatgttac 
aacaggagaa 
aagctaccgg 
actgagacat 
gtccgtgtcc 
gaggcggacg 
gaaagcagcc 
gtcgggtgat 
cacacacaca 



aaggtccaga 
ccctgccgtg 
accagctagg 
gctcttttgg 
gcccactgct 
atggagtcag 
ttcaggaaga 
cagagcttag 
aaccccagtc 
tgtcccccca 
tactataacc 
ccactcccag 
aggcgctgcc 
caggaagcag 
gctactcttt 
gtgttaactt 
accctcctct 
gggacatgat 
acttgaacat 
cggtatggag 
aggtgattct 
ctgaaggttc 
ctggctgctc 
ctgcttggaa 
nnnnnnnnnn 
nnnnnnnnnn 
tcaggaggca 
ctaggctagc 
aaatttagct 
gaagcaaggt 
tccatggggc 
gagactgaaa 
agaaaagata 
gaaggagaga 
cacgcacaca 
gtatgggtgt 
aaggctagaa 
gattctggga 
ccatcccctc 
tgactgctct 
atctgtaatg 
tatacataaa 
gacagggaga 
aggctggcct 
tttgcgccac 
actacctgac 
tcctctggtc 
cataagtaaa 
aagggaaaaa 
ggcctggtgg 
catcttttcc 
agaagcaagg 
tttgtagtag 
ggtgccctcc 
cactccgaga 
aagtatgtaa 
ggaagaagca 
ccgccttcct 
taagtactac 
ctcagaacaa 
ggctggaggc 
ctgtgttcca 
tcaaacaacg 
cacacacaca 



cagtggctgg 
gcttacatca 
ctaaatggtg 
taagagagaa 
ccttgctgta 
gctgagctgc 
gcccagccta 
ctccctgctt 
cagcctctcc 
attctctctc 
ctacctccac 
tccctctagc 
ttacctgtgg 
tgaggggagt 
tcttaaagtg 
taacccactg 
cccaccacat 
ggctaacttc 
ctgaaaggat 
atgctgggtt 
tctcccctcc 
caaaggacag 
tcgcagagga 
ctccagctcc 
nnnnnnnnnn 
nnnnnnnnnn 
aagcagatgg 
caggtacatc 
tgccctccta 
gtggtggcac 
tcactggcca 
aataaataaa 
cttgtcatac 
accaactcca 
cacacttgtt 
tttgcctaca 
gaggacatca 
acaaaaccat 
aatagtatat 
tccagaggtt 
gaatctgatg 
gtaaataaat 
cagggtttct 
tgaactcaga 
cgccaccacc 
tatttaactt 
cctacatgac 
cttagtctgg 
ttgggtccca 
gaagtggcca 
ctttgctttc 
ggagcagtca 
taagagacag 
ggggacgggg 
acaggaaatg 
attcataagg 
aacattaagg 
ggaatgtata 
tggggtccag 
agaggaaagg 
ctgggctggg 
cttgggtgag 
actgtagctc 
cacacacaca 



ctcttggaaa 
ggaggtgcaa 
cttattgttc 
tgagaggaag 
agggaaccag 
aggaagctca 
tgggactata 
cccaccaccc 
tactggtcta 
ccctctcatc 
ttccttgcac 
cagggagcct 
ttgcggcagc 
agctgcctct 
acctgactcc 
ctgcccttcc 
ctcccttgga 
aggggcactt 
tggctgagag 
cttccaggcg 
tcacctggaa 
tggggtgggg 
cctgagttct 
agagaatctg 
nnnnnnnnnn 
nnatgtcagg 
acatctgaat 
ataagaacct 
acacaagaac 
ttgcttgtga 
gccagctagc 
taaataaggg 
aagcctagca 
aagaattgtc 
acatgcatat 
tgcatatctt 
gatcccctcg 
gggttttcca 
atttctgggg 
ctgagttcaa 
ccctcttctg 
aaaccttttt 
ctgtatagcc 
aatccgcctg 
tccaaggctg 
ttaaaggcag 
cccatgtgca 
ttgttttgga 
gagtagtact 
tggctagaga 
aggtcctaag 
tggctgggac 
gtgaggaacc 
gccctcctgg 
gtgactttgg 
accagtttct 
aaaagctctt 
atcagcttcc 
gattctgcct 
ttggttaagg 
gctgcttcct: 
cttcacgggt 
tgtggagatt 
cactccctgt 



gcctgtctct 
gggctgcagg 
gcaagaggcc 
ctggacagag 
caccgatggt 
ccttccttgc 
gctcctccgg 
accaccacat 
cagtgagcgg 
tgttcacctt 
ccctcttttc 
cttccatatc 
ttctctcaca 
caaccagcgt 
atttggaatc 
gccatgtggc 
tctttatctc 
gggccagcct 
gcaggctgca 
cttggctctg 
aatgaaggaa 
actagagagt 
gttcccagct 
acgctatctg 
nnnnnnnnnn 
catggtggtg 
ttgaggcccg 
ttctcaaaaa 
ccaaagtcaa 
tccagcattg 
tggtctactt 
gttaggaaga 
accctcaatt 
tcctgacagt 
gcatacaatt 
tctgtgcacc 
gagttacaca 
aaagaggctc 
ctggagagat 
ttcccagcaa 
ctgtgtctga 
tttttttgtt 
ctggctgtcc 
cctctgcctc 
ctgctgcggc 
ccatctcatg 
aacacatacc 
aatgtgctat 
attggaagag 
gacatgccaa 
acagtcacaa 
cgctaatgca 
ccacagcaag 
ggcaggacgt 
cagagcttaa 
ctccatatga 
ttattgcaga 
ctctgggggt 
atgaagagga 
tgatagtcta 
gccccctctt 
ttgccagtaa 
cagagattcc 
ttgtgtaggc 
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35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 
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36240 

36300 

36360 

36420 
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36540 
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36660 
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36780 

36840 
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tgattttcaa 
cactctgtga 
acctggcctg 
acacacacac 
taatggtgtc 
cagcaaggtg 
catgtcctgc 
cagcacagct 
gctctcttct 
catacttcct 
tgtgagccag 
gctgtaacgg 
tactggcagt 
aatgaacaca 
aagcaacttg 
actaagggga 
ttgctccccg 
tgacactgcc 
acgggtttgc 
aaatgactct 
aagtggaatg 
cattgaacca 
ctgtggacac 
nnnnnnnnnn 
nnnnnnnnnn 
gagagagaga 
attccaacac 
taagtttcag 
cacacacaca 
tggggaaaca 
gccctcactg 
gaactcagaa 
tccccccttt 
ctgggttatt 
ttcatcagtt 
aactggtgtg 
gagcagtctt 
acaataaaag 
cctcgcctga 
ctccgcatgc 
taaatacttg 
caaagttcgt 
agattttggg 
gtaatggtgg 
tgagttcgag 
aaacccctgt 
aaatgttttg 
aaagtttagg 
gtgagacttt 
agagtccagc 
aaaggcattt 
gagtgtgctg 
ctcacataca 
gggtcaaaag 
gtgccacact 
tgagaccccg 
tctgcaagat 
caaacctagg 
agatgaggag 
gcgcttctgc 
agctcgggac 
agcccafcagg 
acgaaccgag 
agccttgtct 



gaaagcaagc 
caggctcagt 
agagaatgtt 
acaccaagga 
acatgttccc 
gcaggctcca 
agagacttgg 
tggatagtct 
atcatggtca 
gtcaagggca 
aaaaaacaaa 
ggccccagcc 
ccttccatga 
cgtttgtttg 
gagaggaaag 
ctcagggcag 
tggcttctta 
cattgtgggc 
ccagaggcca 
aacttgtgtc 
tcccccagag 
gaaatgaaca 
agcgactggg 
nnnnnnnnnn 
nnnnnnnnnn 
gagagagaga 
ttgagaggca 
aatagccagg 
cacacacaca 
tctgagcaga 
gaggtgtctg 
ttcagtttca 
atctaaaatg 
tgcatctaca 
tcctataccg 
gtgctacttg 
gactactcag 
aaaagaaaag 
aagcctccac 
accatggctc 
taatgattag 
gcacattgac 
atgcttcatg 
catatacctt 
atcagcctgg 
ctctaaaaat 
taggctgcct 
ggactctaca 
gcagtgggca 
tttccataaa 
gcagtctaag 
actccacaaa 
tgtccacatg 
cagcaagaga 
ctgagcacag 
cctcactatg 
gagcccatca 
caggcaggat 
ttcacatcaa 
cagcacctga 
cccagtgagg 
aagaacaatg 
gactgcacat 
gacagcatgg 



tagaagtgga 
gacctacctg 
ttgggctgtt 
ctgagtctaa 
tgagccaccc 
agactcagcc 
taagttcgcc 
tcagttctcg 
agttctgggt 
gctgggcaat 
acaaacaaac 
ttgtgctccc 
gtagggagtt 
tggttccgct 
ggtttatttc 
gaactgaagc 
gcctgctttt 
tgggcccccc 
gtctggtggg 
aagttgacca 
actgcctgcc 
ataaaatgga 
cttcccagag 
nnnnnnnnnn 
nnnntggcct 
ggataaaagg 
gaggcagggg 
gctacacaga 
cacacacaaa 
ggaagaaatg 
cacaggagaa 
aagcactgac 
agtaagaacc 
aaagaggtct 
ctgactacac 
ctcatgatcc 
tgaatttgag 
gtggcgagtg 
acggaagggg 
gtgcaccccc 
tttctgaaga 
ctgtcgaccg 
ttgcagctac 
tgatcccagc 
tctacagagt 
cactaattta 
taacaaagcc 
agcatcctca 
aggttcagct 
acctgtgcag 
cccggtgaca 
tttgtcctct 
cacacatgca 
tgccctgtga 
gtggtactga 
tagcccaggc 
tgcccaccat 
ggtggactcc 
tctcccgtat: 
ccaatgcaga 
gtgctggggg 
tcagctggcc 
gaagggatcc 
gaggggaggc 



gtacctcaca 
agaacttata 
ataggacaca 
tgggaggtgg 
tacaaagaaa 
ccgagcgcaa 
ttcgaagctc 
gtcgatgtca 
cacggtcagg 
ctaacccaac 
aaaaaaacac 
cagcctcagc 
ttcttctcag 
acggtttcta 
atctgacaat 
ggaagccgtg 
ttatgctatc 
cacatcaatc 
ggcattttat 
cacgaatcag 
agcactgctg 
tcctttgaac 
cccccatgga 
nnnnnnnnnn 
catcaggtat 
ttagcccagt 
gagctctgtg 
gaaaccctgt 
taagatcttt 
gggtgcgcag 
cacttgcact 
aggagcagtg 
aaaaaaacaa 
tggggaggaa 
aggggctgaa 
caacacttgg 
gctagcctgg 
aggtggccca 
aaagccagct 
acacccaccc 
acaatatttt 
tgtagtacgg 
tctgatgaac 
acttgggagg 
gagttccagg 
aaaaaaaatt 
caatggccat 
ggaaggccac 
ctttatgtag 
ggcctctagg 
tgagctcaat 
gatctataca 
tatacatgcg 
aaaacgtctc 
tatcgttcct 
tggcctggaa 
gttattgaag 
tgtaatctca 
gcgttggagg 
tgcagatgct 
gaggactgga 
aaaccaccca 
atggctccag 
cattggtcct 



gtgacttgtg 
gccaagatgg 
tagagataca 
ttcttcattc 
gccacaggac 
agtggccttg 
agcttcagct 
tgaatgaact 
ggcaggaaga 
agagatgcgc 
caacagctgc 
ctgggctgta 
cctaaaaccc 
ttgtgataaa 
tcgcagggtg 
gaggaacgct 
cagaaccact 
actaatctag 
caattgagtt 
ggcctggttc 
accatttgct 
agatgtgttg 
atcannnnnn 
nnnnnnnnnn 
cagagagaga 
ggtggtggca 
ggccagtttg 
cttgaagaga 
aagaagaaaa 
gacacccacc 
cagcttgccc 
actggggacc 
aagtgtttgg 
acccagttct 
ggtaatctca 
aaggtaggtc 
gctacatgaa 
tcaggtaaaa 
tctacatgtg 
acccacccac 
cgttgatctt 
gatccgctcc 
agtgttgctt 
cagaagcatg 
atatccaagg 
cctttctaaa 
tcagagaagg 
agaaagcaga 
gaagaagaga 
caaagccctg 
ccttggaacc 
tgtatgcacg 
catgcgtgca 
attcagtctc 
gattgatcga 
ctcacagtga 
caataccatg 
ggacttgaaa 
ctggagtggc 
cacagccaac 
ggagctgaga 
gagctcccag 
atgcatatgc 
gtggaggttt 



agctatgagg 
ctgaagccag 
cacacacaca 
ccctcccctg 
tcagttctgt 
caaacatact 
tggggacagt 
cagaggtgag 
agatgatgct 
acaggttagt 
cttcccctct 
ggctactggt 
acagaagttt 
acatgactga 
tcttctcatc 
gctttctggc 
tgcccaggag 
aaaatgaccc 
tcacccttcc 
ttaggagctg 
ttgtatagag 
atcctagggc 
nnnnnnnnnn 
nnnnnnnnnn 
gagagagaga 
cataccttta 
gtttacagag 
aacacacaca 
gaaaggatag 
ctcagaggag 
tagggcgtca 
ccaggttgaa 
gatttggaat 
acccctggaa 
atgttttcat 
agaagttcaa 
aactcataaa 
gtgccaacca 
gtcctctgcc 
ccacatgaca 
gtttagggaa 
aggaagctaa 
tctgggccag 
tagatctctg 
ctatacagaa 
cctatataac 
ctcaaaagag 
gcctgggcca 
gtcaacagtc 
tgttaggggc 
caggtggaag 
tgcacgcaca 
cccacacaca 
ccatcatcca 
tcagttgatt 
tcctcttgct 
ctctataaag 
agtagaaggg 
tgttccctgg 
catcagactg 
cgggattgca 
ggactagacc 
agcagaggac 
gatgccccag 
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40380 

40440 

40500 

40560 

40620 

40680 
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tgttggagga 
agaggcaaaa 
agaaaaaaga 
acccccaagc 
atacgtgcaa 
gtgcacatgt 
ccaacctaaa 
tgtaacatta 
aaggcagcca 
gctgcagggg 
ctgtaactcc 
aggtgtgcac 
catgtggatc 
ggtggtgcac 
aagaccagcc 
gtctcaaaaa 
gaggcagagg 
ggacagccag 
acagaaaact 
cttccagggg 
acagacatgc 
aactaataat 
tagcacttgg 
agtgagttcc 
ataaaaaata 
aggcgttcca 
tccctcagct 
tctgtgagtt 
aatagaaaca 
gactcagtgg 
aaaatttatt 
gggagcctag 
aattaagagc 
actgtagctg 
agccaccatg 
taagtactga 
ctgagaccaa 
ttctccagtt 
gagtagacca 
cagatgggaa 
tggtgctgtg 
tgattattgg 
tggggtttgc 
tcttcgccat 
cagtcctggt 
ctattaatta 
ggaactctac 
catgtgtgac 
ccaatgttct 
ctgatagggt 
cattctagct 
ctcctggggc 
cacagatgca 
agatcaccct 
gacaacactg 
ttgctgcgga 
gagagagtca 
ggtctcagga 
ggattcaatc 
tttttgtgac 
tgtttgcgct 
ctcatctcct 
cctcagtagc 
cattttctcc 



tgctggagcg 
gggatggggg 
tgtctctgaa 
cccacagggt 
tccatggcat 
aattccatca 
gacctacacc 
aagttagaaa 
catcaaaaca 
cctgaattct 
agctccctgg 
acatacacat 
tgggaggagc 
gcctttaatc 
tggtctacag 
aacaaaacag 
caggtggatt 
ggctacacag 
ccagcccagt 
atccaatgcc 
acataggcaa 
ttaaagaaaa 
gagacagagg 
aggacagcca 
aggccaagta 
tagggctagg 
gcatgtggta 
caaggccagc 
tcntacctag 
ctaagagcat 
taaatgttta 
gtttgagtcc 
ataagaactt 
tcttcagaca 
tagttgctgg 
gccatctcta 
gaactaaagc 
ctgatcagct 
gaggccccat 
tttcagacca 
gctgccacag 
ctatgccagg 
cctgtctgag 
gtgaggctcc 
gctggagtct 
attaattaat 
ccatcataca 
cgcatgcatc 
acttctaaat 
ccttgcctgg 
gtctggaggg 
tgcacctgaa 
tgatgggaaa 
ctactggaaa 
gcagtccaac 
accagacaac 
gagacagaga 
ctgagggtat 
tgaacacagg 
acacccctaa 
tatgtctaca 
ctgaaaccac 
aggcagcaga 
cacccttaag 



gtggggcagg 
agaaggcaga 
agtaaaaagt 
ggaaggagag 
acgcagaaac 
ttcaggagac 
caaagaagga 
tctctcttca 
gtgtaaatca 
gttctcagaa 
gatccaacac 
caataaaaat 
tgagaggcac 
ccagcacttg 
agttagttcc 
ccgggcggtg 
tctgagttcg 
agaaaccctg 
ttccagcaca 
ttcttctgac 
aacactcaca 
aaataaggaa 
caggcggatt 
gggctacaca 
attcttggaa 
aatgaggctt 
cctttaatct 
ctggtgtaca 
agcccngcca 
tgactgctct 
ttacttgtat 
cc'aaatcacc 
ctttttaaag 
caccagaaga 
gaattgacct 
cagctcttat 
tgggagattg 
cccgtaccag 
ctaaacagcc 
atgtcatttc 
ttggtgtggc 
aaccagtctc 
gccatgggac 
ctggggtcac 
actgagattt 
cacgtgacaa 
tggcgactat 
agccctgact 
gcatgcacca 
catgcatgaa 
ccctcaatcc 
aaaggggact 
gccttctgga 
ggtggtttca 
agacagacga 
aaggtacgag 
cacacagaga 
agtctactgt 
aaaaaagggg 
agtcactgcc 
cagtctcggc 
tgcagcaagt 
gcagagctct 
taccctatct 



agtgggtggg 
tgggatgggg 
acttgtcact 
aaatgagtcc 
gcaaagacag 
agaagcagca 
caaactataa 
cactgtgtag 
acgagaaaac 
cctgcatcaa 
ccatttctgg 
caaaaccagg 
tgggtgaata 
ggaggcagag 
aggacagcca 
gtggcacacg 
aggccagcct 
tcttgaaata 
cacagggtga 
ctctgtgggc 
aaataaaata 
gccgggggtg 
tctgagttcg 
gagaaaccct 
gaatcccaag 
agcgagagca 
aggctctccc 
tagctagttc 
aanaaagggg 
tccagaagtc 
tattatttaa 
aagaaaaaat 
aattcttatt 
gggcattgga 
caggacctct 
caggttgata 
aaaaatgcag 
gggtctaacc 
tgcctgcagc 
ccaggacatc 
aggatcaggg 
tcaagcagca 
tcttctgttt 
ccagccgtcc 
accattaaac 
ccccagcgtc 
gaagaacaat 
gctaaaagtg 
aacgcctccc 
gctgtggaca 
actgtgttcg 
gcttaagagg 
agcaactggc 
gtctaggttc 
tctaaacttc 
ctactcccat 
gagacagaga 
agagcatttc 
gggggggact 
tacttccctc 
tcccacttcc 
gacttgtgtt 
gtcttctcgg 
tctctgtctc 



taggtgggga 
gggttgtgga 
aagcatgagg 
cacaagttat 
acaatgagtt 
gagttgttgg 
ggaaaaaggt 
atactgtaca 
cagaaacaac 
gccaagagaa 
cctccatcag 
gatgaagggg 
caataaaaaa 
gcaggcgaat 
ggtctacaca 
cctttaatcc 
ggtctacaaa 
aataagcatt 
ctcacaacat 
accaggattg 
aatctagcaa 
gtgtcgcacg 
aggccagcct 
gtcttgaaat 
gggacactaa 
acttcgctgg 
gaagcagagg 
caggacagaa 
agacctgaga 
ntgagttcaa 
atttaaataa 
gttatcattg 
tattttatgt 
tcccattaca 
agaagagcag 
aaatttaatc 
accaccaagg 
aggcctgtgt 
agctcctctc 
aacacagcag 
gctggcattg 
gctcttctcc 
gatggtcgcc 
ctgctgcctt 
agcaacgttt 
catatgggtg 
gtgacagaaa 
gacaagcccg 
acaggaccag 
cgaggcatta 
cgctgttcca 
gctcctacca 
tgccaaaggc 
tgtgggattc 
caaggcacag 
acaacataca 
gagagtctaa 
cccagcatat 
tcgattatct 
accgccatga 
tcctcccctc 
gactgccaca 
agcttcttct 
tgcttgttga 



gcaccttcat 
ggggtaagaa 
atatgagtca 
tttctgatct 
gggtgtggtg 
aaatctaagg 
ggtcgaccaa 
aggaagagaa 
tcaagagaag 
tcaaaactgt 
catcactcac 
tagggaggtg 
tttggtgcat 
ttctgagttc 
gagaaaccct 
cagcacttgg 
gtgagttcca 
tgttgctgtt 
cataactcca 
catacagtgc 
aaaaaatttt 
cctttaatcc 
ggtctacaaa 
aaataaataa 
gtgtatataa 
tgtatgaaag 
cagaaggatt 
agggcgatat 
ccagagagat 
ttcccagcta 
ataagtaaat 
ctaataatca 
atgtaagaac 
gatggttgtg 
tctgtgctct 
tcgtggagcg 
ccctgctcat 
ctgcttccct 
tctaggtgga 
ccaaatttat 
gcacagtgct 
tatgccatgc 
ttcctcatcc 
gactccatgc 
ctctaaaata 
tggaaaatga 
atgctaacat 
aagcgaaagc 
aggtgcagct 
ttcgcaagaa 
gcaccagtgc 
agcctactgc 
tctggacaag 
caggaaatta 
ctggtagaac 
aaaaagcaga 
agagagtcag 
acaagaccct 
caaattctcc 
agtaaagagc 
tgcttctgtg 
cggaaactct 
ctcttgtcgc 
tccttggacc 



40740 

40800 

40860 

40920 

40980 

41040 

41100 

41160 

41220 

41280 

41340 

41400 

41460 

41520 

41580 

41640 

41700 

41760 

41820 

41880 

41940 

42000 

42060 

42120 

42180 

42240 

42300 

42360 

42420 

42480 

42540 

42600 

42660 

42720 

42780 

42840 

42900 

42960 

43020 

43080 

43140 

43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 

43740 

43800 

43860 

43920 

43980 

44040 

44100 

44160 

44220 

44280 

44340 

44400 

44460 

44520 



80/127 



BNSDOCID: <WO 0229059A3_IA> 



WO 02/029059 



PCT/US01/31488 



cttttccttt 
cgcccccact 
ctctagctcc 
aaaatattca 
tgctatgact 
nnnnnnnnnn 
nnnnnnnnnn 
agagcagcca 
accaccacca 
ggttggttag 
agaaactcct 
aaatgtgaca 
acagtctgta 
cctttgcaag 
tatcagattt 
ggaaagccta 
aacagcacac 
gagtacgata 
tactgtgtag 
acagagtcat 
ttcaaggcta 
tggacatctg 
aagcatcacc 
aggtggggtg 
gtctactcca 
tctagttttc 
caggaatgca 
ctttcctttt 
tatgtgtacg 
tctgggttac 
aatcaccccc 
acagacaagg 
acctgagttc 
ttgtcctctg 
gaataaataa 
tgggcaacaa 
gaaaaaacaa 
gggataaaaa 
tgtctacaac 
cctggaggta 
ggagtactag 
aaatttagaa 
attgaggttc 
agcatattgg 
catattcaga 
aaaaattctt 
cagctacagt 
tcaggctggt 
gttcaaat.cc 
cttctggagt 
aaaaaaaaaa 
gtaacccaac 
aaagagaaca 
cgagacacca 
agctgtgtcg 
tgaaatccaa 
acgccagagt 
tgcacaggag 
caattcccag 
aatgcccttt 
tgcatacata 
cgggcgtggt 
tgagttcgag 
aaaccctgtc 



ctatgaacaa 
gtggagaaaa 
atgacttaaa 
gtcaactcct 
cccagatcca 

nnnnnnnnnn 

gtgttcttaa 

ccaccaccac 

tttagtctgt 

gaaagtgaag 

tgtaaataca 

gcctgaatta 

gagaagggca 

tattgctttc 

atcttcatag 

gtgtatgagc 

gcacgtgtat 

gtctcagggg 

cctgccagcc 

actcactgac 

tgctggtagg 

aaggagttcc 

ggtgagcctt 

tgaccaaaat 

tttctttctt 

agagctctgc 

agattttatt 

tatcagtatg 

aggagtttcg 

acccccaccc 

gcctgcaaga 

actgtctgga 

ctctccccat 

aactaaaatt 

taacaccgag 

acacatttcc 

caccctgaca 

atgacctgtc 

gaagctgata 

cctaggggct 

aatgcactac 

cttcttccaa 

gattatagat 

aattaactca 

acaaccatcc 

gtacacacat 

gagatggctc 

cagcaaccac 

gtctgaagac 

aaactatgaa 

agaaaaacac 

gcagaggatg 

aaatcacaat 

ttaaatacca 

gctactgggt 

taactcagag 

agatggctca 

cacccacgtg 

tcacagtacc 

cacaggcaaa 

ggcgcatgct 

gtcagcatgg 

ttgaagaaaa 



aatatctcct 
cccagggcct 
gcatctctgt 
tgccattcaa 
tccacttggt 
nnnnnnnnnn 
nnnnnnnngg 
ctgctgagcc 
caccccacca 
gagacaggag 
gcctacaaaa 
tgctaaaata 
cagacacgac 
gaagcttcca 
tctttctgtt 
cccatgtatg 
accacagcat 
agagttcaga 
tggaacttgg 
cagctttctc 
tctgagttat 
aacaccatcc 
accacctcaa 
gtgatttatc 
ggtgacggtg 
ttttcttttt 
caagtgagct 
tcgatttatt 
ccatgggtat 
agctgtcctg 
agccttcgag 
tggctcagca 
acccgcgtaa 
gtgtgccatg 
agcttagtaa 
agtagaaagg 
tgtacatcgt 
aaaattaacc 
acccagggaa 
cagaagccat 
gtactgccca 
aggtttgcac 
aaggacttta 
attctcataa 
atgtgaacca 
ataacaaaaa 
ataaataaat 
agtgggtaag 
atggtggctc 
agctacagtg 
gaactatgaa 
ttgcctggcc 
agggcatcct 
gataaaagag 
cgcaggaagt 
ggactgaggg 
agcagcttgg 
gtggttaaga 
ggcacttaca 
aggcatgtac 
acttacataa 
tttaatccca 
tctacagagt 
taaaaaaaaa 



taaaggatct 
tgcacatgct 
gctgtcaaat 
aatggatgac 
agcccaggaa 
nnnnnnnnnn 
ttactgggat 
atctctccag 
ccaccccgcc 
ctgtcccttt 
cagccacact 
attcgttaag 
acgagccatg 
tccttgctag 
tttctttatc 
tgagcacttc 
aggtgtgtga 
ggagaactga 
gctcagcctt 
tttttctctc 
cagaactgct 
ccactcggct 
ccagcaaata 
gttaggatct 
gctgaccaaa 
cttttacccc 
acattcccag- 
tgtctatgtt 
acagaaacct 
tgggttctca 
ttcaattctc 
ggtaacgaag 

agggggaagg 

gaatgaacag 
cttttatgtt 
caagcatgta 
tctgggagtc 
tggggaggaa 
gtcagggcag 
ggagctgctt 
cagtgggcta 
acaggccaat 
gcttgcatta 
aaaaaagaca 
gaagctctga 
tctgatgccc 
aaataaatat 
agcacccgac 
agaaccatcc 
tacttacata 
ctacaagaag 
tgcgtttggt 
gacttgagtc 
atcaacaagt 
gaagaggagc 
aggacaacag 
gtagctttaa 
gcattactcc 
atcatccata 
acagtgcaat 
aatactaagc 
gcacttggga 
gagttccagg 
aaagaaaaaa 



cttctagttc 

cagcaggagc 

atacacttcc 

ctcaaagcca 

tgaactcann 
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ttgaactcag 

ccaccaccac 

ccacctgctc 

tctatagtgg 

tatttgttgg 

tcagtgaaca 

acagaggctg 

caaatctttg 

atatttgttt 

agcatatgtg 

gtaccacagc 

gagagtccgt 

ggtggcaagc 

tgttatgtct 

tgtgagagca 

tggatgacga 

tttacctcct 

catgggagtg 

aagaaacagc 

acggtactaa 

atctgttttt 

tgtgtgtatg 

gagaaggcca 

aatgcagcag 

atcatcacaa 

ctcgtgtcat 

gaagaatcaa 

ccctcccaaa 

gaaagtggtt 

tgtcactgaa 

tgagttaggg 

gctgtttatt 

aaattcaaac 

ctggcttgct 

ggctctccca 

ctggtagggc 

tgttgacata 

tttagattcc 

aagtaagagt 

tcttctggag 

ttaaaaaaat 

tgctctttcg 

gtaataagat 

taataaataa 

tcaggaatag 

ctctagcacc 

aagtgacaag 

tgggctttat 

tgagtctggt 

ctagctcaag 

tgagactctg 

tcttgcaaag 

actttagttt 

tacatacata 

agataaatct 

ggcagaggca 

acagccagga 

atcttaaaag 



agggtccccc 

tccatccagt 

agcccttacc 

gagtcagcgg 
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gacctttgga 
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attcctgatt 

aaggtgaata 

aaaatactgt 

accttaaaac 

aaataagacg 
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atttgttggg 

tgtgaacacc 

acatgtgtgt 

cttttcctcc 

tcctttatcc 

atccactctg 

ggagtaactt 

aggggaaaaa 

atacatggat 

attacagctg 

tacacctggc 

ggattgaacc 

ccatttcttt 

tgtgtatgct 

gaagagagtg 

caccaccacc 

aaacacacac 

aagcctgaga 

ctctatgatg 

cacacatcaa 

tttacatgcg 

cagcattgaa 

tttctatgct 

tcagctttta 

aggtcagaag 

ctagcacaca 

cagcaatcat 

cattttctca 

aaaactagcc 

cacataacac 

taaaactatg 

tgtctgaaga 

atatgaaaaa 

aaagtccaga 

ctgactccct 

ataaatctta 

ggctgggggt 

accaacgtag 

tgataatcct 

ctgaataaag 

aacacaggcc 

gcccacctgg 

cccaggccag 

gacctgagtt 
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catgcatgca 

taaaagaagc 
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ctacacagag 
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gactggagag 
aggacagcca 
aacaacaaca 
catgatggtt 
gcatatatgt 
aggaggctgg 
acatgaaaaa 
tgtgaatcat 
cactccccag 
tgggaatgta 
gattgtggtg 
ggttcagtga 
ttggaatgct 
ttgtcttaac 
cttgtgaaaa 
acataggatt 
agagctggat 
gttgagaaca 
ggttttaatt 
atgtctgtgt 
cccggaagta 
aaagagccac 
tccagaggtt 
ggatccgatg 
tcaaaaaaga 
ctgtccctta 
tacacgtgct 
gcacacaaga 
aatcgggagc 
atgaggatgt 
aattgtacac 
acagacatgt 
agccaggtgg 
gaggctggag 
tcagttccca 
tctgatgtgt 
ccatgataac 
aaggagtaat 
tgttttttct 
tgtcctatgc 
ctttccacag 
agttctgagc 
ttctttcttt 
tttctttgta 
tctgcctgcc 
ctgggctcaa 
attactttgt 
ggctataata 
agtcctgctc 
acacagaaca 
ggaaagagaa 
nnnnnnnnnn 
cagctactga 
ctcatcttgg 
cccagcagcc 
gctcctgctc 
tctcaattag 
cacccccacc 
cttacccttc 
attgccacac 
tctcaccatt 
ccagcccggt 
cagctgtcag 
attagaaacc 



atggctccac 
ggactataca 
acaaaaccac 
cacaaccatc 
ggtgtacata 
gtttgtagcg 
aaaaagacat 
aagctctgaa 
cactgcaaaa 
actcagccga 
gtgcatacct 
catccttggt 
gtctcaataa 
ctctgacctc 
cctaccccag 
tttatactag 
atggtggtgt 
gctagactat 
tttttctttg 
atcacttgtg 
ttattacaga 
ctcgggctgg 
tggagatcaa 
ccctcttctg 
aaaaagaaac 
atgttaafcaa 
aaagcaagtg 
cttttctgct 
tgaggaagat 
tctagaactc 
tttaaatgaa 
gtgtgtgtgt 
ttgtggctca 
agatggctca 
gtaaccacat 
gtctggaaac 
attgaagcta 
gttagctttt 
tacccctctt 
agcccaaatg 
gttatgttag 
tcaattcctt 
ctttctttct 
taccctggct 
tctgcctccc 
ttcttaacat 
ttctgatctt 
tagactctcc 
ctatattttt 
tttttttttc 
atgagagaan 
nnnnnnnnnn 
ttcctcctcc 
acttctgcgc 
ccccagttct 
tgattggtgt 
ggaccctcac 
ctatacgcac 
taatcctccc 
ggacacacag 
cacaagcaca 
gccaagtatc 
cctggccccg 
cgctagctcc 



agttaagaac 
gagaaaccct 
ttgttcttac 
agttgcaggg 
catgtataca 
cagaggtaga 
ttagattcct 
agtaagaata 
gatcatgaaa 
tgcagtgctc 
ataagctcac 
cacatagaga 
ataaaagaaa 
tgtgttcatc 
agtctctgtt 
aaaaataaaa 
ctagttatag 
acggtctcaa 
tgtttgttac 
tgcctgaaac 
aggttgtgag 
agagatggct 
atcccagcaa 
gtgtgtctga 
agccacctct 
cacaggtata 
tgaaccttaa 
caagagtatc 
atggggagtt 
aaaaatggtg 
tgcaataaaa 
gttaaaacat 
aaaaataatg 
gcggttaaga 
ggtggttcac 
agctacagtg 
gacttgctac 
tgtcctaaat 
ttactttgct 
ccatcttcct 
gagactggag 
ttctttcttt 
ttctttcttt 
gtcctggaac 
aagtgctggg 
"tgtggagaga 
ttgctgcagt 
ttgcaatact 
tcctaagctt 
ttgtctatag 
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tccctcctcc 
atgtcctcag 
cttcctgaaa 
cacctgctgg 
cctctgatat 
atacacactt 
caccggacac 
gagatctcat 
ccacttctgc 
cccaactaca 
gctggcgggc 
ctaattgcca 



acttgttctg 
gtttcgaaaa 
agaggacttt 
atccaaggtc 
ctcatataca 
ggtaaaaaga 
gcatcacacc 
agcctagtat 
ccagaaatgc 
acctgtcgta 
tacgtggaaa 
atttgaggcc 
gaaagaaaaa 
aagtctcctc 
cagaggaccc 
tgaataagta 
catccagaac 
caaacaaaag 
ctaacgtgta 
ccaaggaagc 
cagccatgtg 
cagtggttaa 
ctacatggtg 
agacagctaa 
ccactctccc 
tgatactatg 
atacattatg- 
tgtatgaagt 
ctaatggcta 
ataactcagc 
cttgtctcag 
ttcttggcat 
ataataacaa 
acactgactg 
agacatctgt 
aaagtcattg 
tactgctgag 
ttgtttcctt 
ttcccctctc 
gcctcagcct 
tctgctcagt 
ctttctctct 
ctttctttct 
tcactttgta 
attaaaggtg 
aaagtattgt 
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ttcctccctc 
cccagacctt 
cttctgaggt 
ataggcttgc 
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agctgcatca 
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caaggagatg 
ctccagcttc 
tccccaaaat 
gcctgctcct 
gttctgagct 



aggtctacag 
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ggtttgattc 
ctgtcttctg 
taaaataaaa 
ctctagcttg 
atatccaaaa 
gcactgtaag 
agatcctctg 
tacagagcac 
gtagaggcag 
agtctggtct 
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aggctccagc 
tgtatttttt 
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taagggatct 
tggttgtttt 
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ccttgtttat 
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agtgagttcc 
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ggagtgagtt 
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tagaagatgt 
atacagtttt 
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ggctctttct 
ctccctctgc 
ttaacagagt 
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nnnnnnnnnt 
tcatgctctg 
ttctcctctc 
acctcctttg 
actgttcgtg 
ctccacctcc 
ccagggactt 
ccagagggct 
agagggtagc 
caggaagtag 
tgccagcctc 
ctagggtgta 
ccggctgccc 
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gaggcacaca 
tagacctaga 
gcaatctgcc 
ccagcccggc 
aaagagtctt 
ggaggacaag 
agaggggtga 
actcttgtag 
tgggagacca 
tggcctctcc 
ctacctctag 
ccttgggaag 
gcaagagaac 
tggttagaaa 
gcccagggcc 
tgcaagtttc 
gcctatgagc 
gttagggttc 
gaagaggaag 
ggtagctgcg 
gctagacacc 
ctgcaggcct 
tgtctagcat 
cctcagtcta 
ggatctgcgc 
gaagcagggc 
taggggcacc 
ctgctgttgc 
atggtacaga 
agtgggcatc 
ataagaaaag 
aaaattactt 
taactggtga 
gataatgctt 
gtgtgtttgc 
ttgctctcca 
aagacaggct 
gcctcgaact 
tgtaactaca 
ttgaaatgac 
ccaaaggtcc 
gatctgactc 
taaattaatc 
ctgccccaag 
catgctaatg 
ttcttttaag 
gtaccgtgtc 
gttgtgagct 
taagtactct 
tctaaaagta 
gttgaacaaa 
acagactttg 
tcagcaagct 
aggtggggct 
ttgcccctcg 
ctctctctct 
ctgcttcaac 
cctttttttt 
aggctcaggc 
ggatggaaag 
cactcatgtc 
ctaagtggag 
cccctccaca 
actaaccaac 



tagaggaaaa 
gaggactggc 
ccgtgggtgc 
ccattcattc 
gggagctaaa 
agagagaaga 
cttgggtgac 
ggtaggtcag 
acacttgtct 
agcctctgct 
ccaaaactgt 
aaatgaagat 
agcttctaca 
gagtcacagg 
cctcctcact 
aagtcaggac 
ctgttgtcag 
ctcacactgc 
ggtcggttca 
gctgcctcca 
accagattct 
gcccagaaca 
cctggctaaa 
gacaacactt 
taggtacagg 
cacctctggc 
ctatccccaa 
agatggagtt 
gtatcccagg 
acgaccacaa 
gagagaacat 
ctgtactttt 
agtggaataa 
ttacacctct 
tcatgggtat 
cttttttttt 
ttctctgtgt 
tgcagagacc 
catagctccc 
tgtttggggc 
tgagttcaaa 
cctcttctgg 
ttttttaaaa 
tgctgggatt 
tatcccaggc 
gtttattttt 
cttgcagtac 
gcatggtggg 
taactgctga 
ctagggatgg 
gggctttgtg 
actcttctag 
gtacaaacaa 
actggctttc 
tctttcaccc 
ttaaacagga 
cttctgagtg 
tttttttttt 
ctccctgggt 
atggagccag 
cactgggaga 
ctgagcaggc 
tggctcctcc 
agaatggatg 



ggctgagagc 
atgtggccag 
ctcccagcca 
tttattcagg 
gggtcccttc 
aggaggaaga 
aggacacagg 
cctccaacca 
ctccttttcc 
tacaaatgtg 
gtccacaagt 
tccccatgcc 
ctggtgatgg 
agctgagagg 
tgggggagct 
caaaggtctt 
gggaatctcg 
ccacaaagag 
cagtgaagga 
caccagcctc 
gctcagagat 
gcagatgact 
gggaacactt 
ctagaaacct 
acagaaagtc 
tcaggcacac 
gctccttacc 
taggcagcca 
tcaggttggc 
agctcatgtt 
gaggaccaga 
gagacaagag 
taccacttac 
ctgtgtgcac 
atggagtcag 
tttttttttg 
agccctggct 
cacctgcctc 
tcttttttgg 
tggagagatg 
tcccagcaac 
agtgtctgaa 
agagaaagaa 
acagagctgt 
tgtcctcaga 
atcttatgtg 
cctcacagtc 
tgctgggaat 
gccacttctc 
cggatgtgcg 
catgccaggc 
tagtagtgtc 
gctatgggcc 
aaaatgaccc 
tctcccagag 
tctggttatg 
ctgggattat 
tttaaagaat 
accagcataa 
aggatctttc 
tactgtagta 
tcgtatcctc 
ctgcagaact 
agcatgtgtg 



tgagccaggc 
ggaccaaacg 
catccatttg 
tggcataaaa 
cttgcctcag 
ctcctggcag 
ttggggacct 
tgaagtataa 
cacccagggt 
gagggaccct 
gccagcccac 
tgccttcctc 
tccttccggt 
ctgatccagg 
gaactggggg 
gcctccatgg 
ctgttcctgg 
gggcacgcct 
ggagagggac 
gctgagctcc 
cccacgaagg 
ggtcagtgct 
gaacccagcg 
tttttttttt 
tagaggagca 
tggcagctcc 
cagttggctg- 
caccttggtg 
tagtacctcg 
gttcttaaag 
cagcacctgg 
ccagaaattc 
ccctttgcaa 
acataagcat 
aagtaggtaa 
gtgttgccat 
gtcctggaac 
tgcctcctga 
acacagggtc 
gctcagcggc 
cacatggtgg 
gacagctaga 
atgatggcta 
acaacaagcc 
ctctctatgt 
tatgggtatt 
cagaggaggg 
caaaccctgg 
caccttgagc 
ttcatgtgcc 
aagcactcaa 
tccactacag 
ttcctgtcct 
atagaggagc 
agtcagctat 
tggccctaac 
ggtgtaagcc 
ccatgcagtt 
tgcctgcagg 
ctctctgaac 
ttcaaggaag 
tcaccacggg 
ggcaatgctg 
gtgccacaca 



tggcatggag 
tggcacagag 
cccagaactg 
atcactacaa 
tccccaagat 
tgttggcatc 
gaatgtcttc 
caccaaggcc 
gtctggaata 
aagttaggga 
aaaagatcac 
caggccccac 
cccaccctat 
tggggactca 
tagtcttcct 
aagtcagctt 
agctggggca 
atggtgtcct 
attcgattca 
atggtagact 
tctgggccct 
gccccaaggc 
gttgattgga 
tttttttttt 
tatcaaatgc 
cgtactctgc 
agggtggcca 
ggcctctcct 
gacacgttcc 
gggaaatgag 
acctgtctgg 
agggttagca 
ggtgacatgg 
atatgtttgt 
acatcagtcg 
ctttttgttg 
tcactctgta 
gtgctgggat 
tcatggatcc 
taagaacact 
ctcacaacca 
gtgtacttac 
catacttctc 
caagtttgtt 
aattcagaac 
ttgcctgagc 
caccatttcc 
tcctctgcaa 
ttttcttcct 
tggtttatgt 
caactgagct 
cctgagttct 
tgcctctcag 
cacagcaaac 
aattcgagtt 
tatcttcaac 
accacactca 
aggacagcat 
cgggtcctct 
ctcaatgtcc 
aagcaacagg 
ctacagagaa 
ggcccggctt 
cctgggaccc 



gtagccctag 
agggctcagt 
tgacgtcaaa 
aaactttaca 
tcctggcagg 
tccaaatacc 
agcaagggac 
agtctaagct 
tgtctaaaga 
cttgcctaac 
cccctgagcc 
cccacctgct 
cccacaaagc 
ggatgctgct 
ccatgcgggg 
tgtcattctg 
gcgcgctggg 
ccatgatgaa 
tggctacgct 
gatgttgcac 
ggaacaattc 
tatgtggatc 
atctgttaga 
ttttaaatca 
tcccatccag 
ccagaccacc 
ccaggtccag 
gcagcgaggg 
actcaaaata 
ccacctacag 
agtctgggcc 
tgctttcact 
gaccaaatga 
atcggtgtga 
tcttcctaca 
ttgttatttc 
aatcaggctg 
ctaagatgtg 
caagctggct 
gactgctctt 
tccgtaacaa 
atgtaataaa 
tctcgtctct 
gtgttttaga 
gaccttgaac 
atttgtctgt 
ccctgaactg 
gagaagccag 
cctatctcga 
gttgctaagg 
acacatcccg 
ctatctgctg 
ttctctccgc 
agtaggaagc 
tttttttcct 
ttcagtcttc 
gctcacacaa 
ggaaatgacc 
gccagtgggg 
cacagtgaga 
aaggtgagag 
gtctggctgc 
gcccagtcaa 
cagcactcag 
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acagctgggg 
gaacaaacaa 
tactctagcc 
tggcacggag 
atctatgaag 
tgggctccgc 
atattttctt 
aaacccttca 
ttgccagcca 
tcgaagagga 
gtggaagaaa 
caggcagagg 
ctatnnnnnn 
nnnnnnnnnn 
ggaaaaagga 
ctaatgattg 
atataaaaac 
tttctgtctc 
aagaggaaat 
acagaaacta 
acagggtttc 
tcgaactcag 
gcgccaccat 
tttaaaaaaa 
tgcccttgcc 
tcagcactca 
tacataacaa 
aggggccact 
aactgtcctc 
tttgtaattt 
tatgaacatt 
caattacgtg 
gtctattcgt 
ctcccagttg 
aaggattaca 
atctttgcat 
agggagtagg 
tccctgtggg 
aggctctaag 
ctattcactc 
tgagtccttc 
caatccaagg 
taactgcagc 
taatgcgaaa 
gagacattca 
agaatctata 
gagaaaattt 
gtcaactttt 
gctccttcct 
gtaggctttg 
gaaagatgtt 
tgaattgfcat 
cagagagaca 
ggttcgtggt 
ggttagttag 
ttttttacaa 
ggaaatatta 
gtagccgatc 
tcgtaggtat 
gagtggggat 
ctcctctggt 
ggaggatctc 
attttgttta 
cttgccagcg 



cagaagggtc 
aactagcccg 
aaggagtcac 
cttcatccaa 
cactgtgtgg 
taaaacccca 
tgaggagafcg 
ggagctacct 
aaggcaaagt 
tatgacaccg 
tctttctggg 
gcaggagcca 
nnnnnnnnnn 
nnnnnnnnnn 
atcttaaatt 
atgtggaaag 
acaggctgaa 
aggttcctac 
aaaccctttc 
taactaatat 
tctgtatggc 
aaatccacct 
tgcctggctg 
aaaatggcca 
ataaattatg 
tgagcctgag 
gatcctgtcc 
gcaaagacag 
ctctctgtaa 
taattagttt 
ttcaaggcac 
ccaccagcaa 
agttctgatc 
agtgttctca 
gtgttaacta 
aaattctttt 
tctggccact 
aagggcacat 
tggtcctgag 
ttctcatttt 
tattgttttt 
tacacttttg 
tgatatggta 
tattgagtaa 
actctgacac 
aataactcac 
tagcaactgt 
aattatgcag 
gctgcctacc 
gaggggcgga 
aatgcagttt 
ttacatgccc 
acatagttct 
gtttcctttg 
gattcctggc 
aaccgtttgc 
tttccctcct 
ttacatgtat 
gcagttagaa 
ggccttttgt 
ctatcactat 
tgcttctgaa 
aagagcaggt 
ccttgaccac 



atgagtccaa 
tcctgttatc 
ggttaggcta 
tcagaatgtt 
gaactcctct 
gcacagagaa 
aacctcaaag 
acccagggac 
ccagactgac 
cgtgcatcat 
tgaggctcgg 
tgtaacataa 
nnnnnnnnnn 
nnnnnnnnnn 
agaagatgcc 
tcacgaggga 
caaaccacaa 
cttgacttcc 
ctccccacat 
agttggtttt 
cctggctgtc 
gcctctgcct 
gttttctttt 
tggcatafcaa 
taaaaatttc 
gcagaggcag 
aaataataac 
tatgacaaaa 
ttttcagttt 
tttaataata 
ttgatacata 
tgttagcttc 
ttgttatatc 
ttgaatttca 
ggatagactt 
taatcaaagt 
gatgagtgtc 
ctgtgttgta 
ctagctaact 
gcagagcact 
cttttactta 
ctaaagaatc 
catacatatt 
tttttaaaaa 
cagcacccta 
gaagctgaca 
tctaatagag 
acgctgatta 
tttaaccgtt 
caggcactca 
gtagaattat 
tgtccctgtc 
tatatttaat 
ttcattacat 
acgcggacag 
ctttaagagt 
gcttaaaccg 
catacctatt 
gcctctagtt 
gtgccccaag 
cctgccactg 
cttctttggc 
aggctgttag 
tgttaacttt 



agcgaacttg 
tcagccacag 
gaagcaaggg 
cagagcaata 
cttcccttac 
cagttacctg 
aagctgtgtc 
cagactttac 
ctgtatctca 
gtccaccgac 
gtcaaacttg 
aatcagcctn 
nnnnnnnnnn 
nnnnnnnnnn 
ttgataagat 
tggtgtcacc 
agcagtagtc 
ctcagtgaag 
agtttttggt 
ctttttttgt 
ctggaactca 
ctgcctccca 
ttttttaata 
tataaaaaga 
catatggaca 
gaggatggtg 
aacagtaata- 
ccactggccc 
ctaattttta" 
gaaacaagct 
cctcagattt 
cttttttccc 
cctttttatt 
ttagcagctg 
tgacaaagaa 
tcctcaaaag 
caggccacag 
acaggattcc 
gcccttggct 
taccaggtag 
aatcccattt 
actcattttt 
ctctcttcct 
tcaatactca 
ctcagttcct 
ttactcactg 
cctgccagtt 
ttcagacgac 
ctcagtgctc 
aaccagctaa 
tgactaaaat 
ccccatagca 
tttttccttt 
ttgtatccag 
tgacaccaca 
cagtgttttg 
aagctgtgat 
tctggcatat 
aaatgaaatg 
gctgttgtgt 
ccagaaaacc 
ctgagaaact 
agcttgggtc 
tgttaaccaa 



tgtaacattg 
atgatgggcc 
aagccttagc 
agctttgaaa 
gagcagggcc 
gcacgtgaca 
ctggatagac 
ctcccagtac 
ggttgctcca 
actgtgaacc 
gtcctccaga 
ctgcctgtct 
nnnnnnnnnn 
nnnnaaccaa 
tggcatgtag 
ctgggcagat 
ctcaatggct 
gcatgtcaca 
tatgatgtta 
ttgttttgtt 
ctttgtagac 
agtggtggga 
catttataat 
agtgcttaca 
tcagtctcaa 
agctcaaggc 
• atttcataca 
tgcctaattg 
cataactacc 
aagtgctaag 
gccctccagg 
taccatctga 
gtttccctgg 
tttcattaat 
ctatgagaac 
cctctctctg 
tacaggtgtg 
tgtcttaaca 
ttcccttgat 
ctatgtcctg 
gaaatgcgcc 
atatgcaaaa 
attatccact 
attttttaga 
gagccttcct 
tgttgcagtc 
atcagtagtt 
aaattgttgg 
attagcacat 
gcacttagag 
tgagtcattt 
acagataata 
gtcgaacatt 
actagttact 
gttgtctgat 
cacatttcac 
cataatttaa 
gtttgtctat 
ttgcgtgtgt 
ttcacacagt 
ctgctgtgtg 
ccataaccaa 
ttacatgtct 
ctcatctttt 



tcagaccctc 
caaggatcag 
tgagacagct 
cccgacttcc 
ctggtcctct 
aaaactcaat 
acagcataat 
caggcctcgt 
gcaggaacca 
gctcatccag 
aacctgcagc 
tgcctagaac 
nnnnnnnnnn 
ggcaggtctt 
gtatgtctca 
ggcctggggt 
tctgctttag 
tgagagttgt 
tatgtcaaca 
ttgttttgag 
caggctggcc 
ttaaaggcat 
gcattttaga 
aatcaccatg 
gcttacaatc 
cagcttagtc 
tagaactaga 
tattttaaat 
atgtattctt 
aatattttca 
tgagcagtac 
ttctgtttca 
gttccaacac 
ggcacagaag 
atatcttatt 
ttcccatctc 
cgtggttctg 
agccttgctc 
taccagataa 
gaagtacgaa 
agggacactt 
tgtcacctat 
aataggtgac 
aataattaga 
ctgccggagg 
atttttttct 
gagaatgcaa 
tgcctgcacg 
gttccagaag 
aagctctgat 
ggattccctg 
ggattgtctg 
ttcacatgat 
tctgataagc 
cgtttcccac 
ccagattatt 
gcctttctag 
tacaaagacc 
gatgaacctg 
tgttttctgc 
ttccccgcgt 
atcagttagc 
cccaggtcca 
gctgcctgtt 
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ttttgggggg 
ctctcttccc 
ctttgctttt 
agccgtgcac 
atcactgcct 
acaaaagagg 
ttctctgaga 
cctgagactg 
ggggttttgc 
tcgttctagt 
gggagataat 
aaagcacccc 
tttgagtggc 
tggggaccag 
agccatgact 
acaaggacgg 
aacgagagca 
gtcagcctcc 
gccaccgtcc 
gagccagcct 
ggacaggaat 
aagggcaatg 
tgagggcagt 
ggagcacttt 
cccgtgtttt 
ggctagtgtc 
gactaggcca 
ggtaatctct 
ctgagttctc 
gcagctgcag 
gtaaatctct 
accattcctg 
acaggttgta 
ggaccaaaaa 
aaagttgtgc 
gaagaaggat 
tccaaggagc 
gttcaattcc 
atccgaccca 
catgagagca 
cattcatgcc 
agtaaatgga 
ccagcatgtt 
tgccctctgc 
gtgctccctc 
aacctagagt 
ttctcaaatg 
agatgattat 
atctagaatt 
nnnnnnnnnn 
nnnnnnnnnn 
tgccttaaga 
ggagttctgg 
gtgtgtatgc 
tgtaggtcct 
tacaggcccc 
ctcacacctg 
aagccagcct 
cccgttttaa 
agcttcatta 
tttattactt 
acagtggctg 
tgtctgtcat 
caatgtacat 



tttttttggt 
agcctctcaa 
tttaaaatag 
atggtatgag 
agaggcccct 
ttgtctgagg 
tttcagagca 
cagagcagtt 
acacacttaa 
agccaaccaa 
cttaggagtg 
actggtaatc 
acacagacat 
agtttgcgat 
cagatgggct 
gacaaatgtc 
tcattaccaa 
tgaaccactc 
gggcagcaca 
ttctttccac 
cagcagatat 
catccttggg 
gagccatatc 
cttggcttct 
gttgtttttt 
tatcactgag 
gcccctcggt 
ctccagggcc 
agtacttctt 
ctcccagatg 
gctgttccca 
cttcccttct 
tttcctaata 
aaaaaaaaaa 
atcaggccta 
ctcaagtcgg 
tatttcagag 
atctctctgt 
catagttttc 
gacccttggg 
ccaatgcaaa 
attggctgtg 
ttgttttgtt 
tacctctccc 
gatcctgctc 
tcacttcaga 
cagtgatatt 
cctttgagcc 
tattctaaag 
nnnnnnnnnn 
nnnnnnnnnn 
gttctttgtt 
ctgttgtttt 
gcgcgtgctc 
ggggatcaga 
ccatggacaa 
tgaccctacc 
ggactacaca 
aacactgggc 
acggcctagt 
tttctctgga 
ctgccaaagg 
ctaattctag 
aaaataataa 



tttgtttaag 
atgttagaat 
agttttgcca 
gaactcacat 
ttcttccttc 
cataagacta 
gtgactacct 
tccaaaaatt 
ggtgcgtaag 
agagcagttg 
ctatgcttct 
cctttttgat 
aatggttagg 
gagtcatgtt 
tcaggcccag 
ctgacgcaca 
cagcaggcag 
ctgcaggccc 
gaggatcgca 
taccctgctg 
gggcccagtg 
ccaataaaaa 
cctccatgcc 
ttcttcctgc 
ctgagatgat 
ccaggacccc 
agagtaggca 
tcacgagagc 
tgactgccgc 
ggaagccttc 
ggggcagggc 
tgtaactcct 
ttttccctat 
aaaaaagccc 
gcacttgggt 
aggccagtct 
atggtttaat 
gctggcctca 
cctcagcctt 
tgctactgcc 
atggtctcca 
ttgaaaattg 
ttgtgtttta 
aggtgtgcca 
actcttgaga 
gcccaggctg 
tacaggccta 
atatctctct 
ctgactggcc 
nnnnnnnnnn 
nnnnnnnnnn 
tatgggaatt 
ttaatcttta 
aacttgcaaa 
ctctgttagg 
ctttttcttt 
accactcatg 
gtgagtaaag 
aaggtgaaac 
tatctttaaa 
acatttaaaa 
actggagttc 
gggatctgac 
ttaaaaaaat 



ccaagatcag 
tacaagcatg 
tagcaacaga 
atgtgtgaat 
cttgcagttg 
ccttcaataa 
tcaataaaac 
ttagacaaag 
taaataaact 
aaccaaagca 
gcgtcctaca 
tatttttttt 
cttcggtgtg 
ccatctgatt 
ctgctcagtt 
tttccttctg 
atacgccttg 
aacaccagtg 
aaaggacagg 
tgcctcacac 
cctctactca 
aggtcttctg 
ttgtagacag 
cccagcagct 
agggcagagt 
aacccagcac 
ttgactctct- 
cggatgggcg 
tgtggggcct 
tgttgtaaga 
tccagctaaa 
ccctacatgt 
aagtgacaca 
tagaaattta 
actggtaacc 
caagtgacac 
ctggggaccc 
tcagacacac 
tgacccagag 
aggtaatggt 
caccagagca 
taaagccaag 
ggcagcgtct 
ccatgctggg 
cagtcttcct 
gcctcaaact 
cacctggctt 
tctgcttctt 
tcagaattgc 
nnnnnnnnnn 
nnnnnnnaca 
tatgggaatg 
ttcatcttgc 
attgcaaaat 
cttggtggta 
gaaaacctgt 
aggaagaggt 
gctatctata 
aaaagtcaat 
actgtatgca 
gtactttagg 
actcccaagc 
accctcacag 
gaaataaaat 



ttatatggcc 
catccctcag 
aatctaacct 
ggaagttcat 
tcgtgctagc 
aacatgcaca 
atggacagac 
ggtaggatga 
gagctacact 
cctagacttc 
agtattatga 
ataaattcta 
tgctcattca 
tctgtcggat 
catcttctgg 
ttcttgcact 
ccacaggcat 
tgtccttcac 
agattctgca 
ctcacatgaa 
tcctctgagt 
gctgtaataa 
cctatcctga 
tggaaacgta 
acccaacctc 
caccatgcca 
tggtgtgact 
ttgctgagag 
gtcacgctga 
cttgcagagc 
ggttgtcagt 
cgcccggtcc 
aaatcttaaa 
cttgctcaaa 
ctagcactca 
cccatctaag 
agattgtgga 
tctgtagtaa 
ggaagagcca 
gtagacactg 
tggcattctc 
ggtcaagaat 
ctctgtgtag 
cctaagcgcc 
tctactctgt 
tgagatcctc 
tccctgatag 
cctctcttcc 
catccttctg 
nnnnnnnnnn 
tagtgtcaac 
ttccacagaa 
gtgtgtgtgt 
tcagtctcct 
ggtgctttac 
ttctggcttg 
aggaggacta 
tactcaccac 
taatttcaca 
ggttagtact 
ggctggagag 
acccaggtgg 
actcacaggc 
accaggcaag 



caggctgagc 
catacctttc 
aactaagcat 
agagaccggc 
tgactgtact 
gacagtttgc 
ggtttgctta 
agaaggctgc 
gacaggatgc 
aaacatcgtg 
aactgtctag 
gtcttggggt 
ctttgcttcc 
ccggctgcag 
ggaatagatg 
tccagggtct 
cttccctgtt 
tggcactgtc 
ctgctatggt 
aaggataagg 
ctttcctgga 
aaaagcccgt 
aaatgagcga 
tccactttca 
atataggcta 
gtcacgtgat 
aggaactgtg 
gcagcagagg 
gacactgaga 
gctcatgcag 
cgccaggaga 
tgcagaaaac 
ttacacaaag 
taagtcatca 
ggaggctgag 
agatcaccat 
ttttctgtct 
ctgtgggaaa 
cagtggagag 
gagtcttcaa 
attagaaata 
gaagccttcc 
ccttggctcc 
ctgtgcatta 
atccccagat 
gtgtcccagc 
attcctagta 
tgcagggttg 
cctttagnnn 
nnnnnnnnnn 
tttcaaattc 
cccatccagc 
gtgtgtgtgt 
ctttccaccc 



tgagccatct 
ggtgtgatag 
acagaattgg 
atggcaagac 
taaagtcaat 
tggtttcaat 
tcagttaaga 
caatcacaac 
agtggaacac 
gtggcacacg 
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cctttaaccc 
ggtctacaga 
aaaaaaaaaa 
tcccagaggt 
aacgcaatct 
ataaaataca 
cctttaatcc 
ggtctgagta 
gggccctagt 
ctgacacttt 
acacaaacaa 
agtacgtggg 
gatagtcttt 
aaactgcaaa 
ctaaaatatt 
gttataaaag 
aattcagaac 
aaatacctcc 
tgtcccaggc 
aaaacttgcc 
agtctctaca 
cacctgtgat 
cctccaacta 
gaatgttgcc 
gatttgggct 
tggtagcaaa 
aaaaaacaaa 
agttcagcca 
cgactgctct 
atccgtaaca 
catataatca 
gttaagtgtg 
aaaaggatgc 
acagaatcca 
acgcacattc 
gcgcacacac 
atatgttcaa 
aagaattaat 
caaaaaaatc 
aacactagca 
gctgaaaaat 
cagcgttttc 
taaaagaagt 
gaaacagagg 
aaacaaacta 
tgatggtctt 
ctcccgatcg 
ctagggccat 
gtcccaccat 
ctgccggggt 
ccccagcgtg 
ccgcgcggca 
tggcgggacg 
gagttgttcc 
aatggcaacc 
agcagtgacc 
cttggcactg 
ctgagagttg 
catgtggagc 
tgaactgagt 
ttcctaaaag 
ctgatctgtt 
ttaacaccgt 
gggacccagg 



cagcactcag 
gtgagttcca 
aaggacttta 
cctgggttca 
aacaccctct 
taagtaaatg 
cagcacttgg 
aatagagcct 
gcttccattt 
tgttcgctaa 
aacaaaacac 
cttgagtrtca 
aaaacaaaac 
tttatctatg 
tttatttaaa 
taaagaatta 
aatccaagct 
attctgacaa 
ttttcatgat 
agatactggg 
atctcctgaa 
gattgctagc 
gccccttgct 
caaagccata 
agttactttg 
agaaactgga 
caaggcctgg 
acaaaagact 
tccgaaggtc 
agatctgact 
ataaataaat 
gagttaaaat 
tctgtggtag 
cagcggagtg 
gccatcccca 
actctttttt 
ttgaaaatga 
aacatgcagg 
tatggggttt 
agagaggagt 
tttaaaatgt 
tctctcgtga 
aaaacagccg 
ccgccgaaga 
aaaaaactcg 
agagaccccg 
acagcggctc 
cgagcccgct 
ggctgcaccg 
cccgcgggct 
agcgcggtgc 
ggcagagcac 
tgagggcggg 
cccgccagct 
cagggaacag 
tgggtacttc 
tgggataaac 
tggccgcaag 
tccagagtgg 
taaagtgtgt 
cccaaatggc 
tgtaatccca 
tgagttcgag 
cgtggtggca 



gaggcagagg 
ggacagccag 
aattgggctg 
attcccagca 
tcaggcatgc 
agtcttttaa 
gaggcagagg 
tgtacttcta 
tccactgttt 
tcatcaggca 
aacactacgg 
aggccggcct 
actattttat 
aatgaagtct 
gaaaactcag 
tttgcaattg 
tcacattcca 
ctgtttcccc 
tttagtcatt 
atatcaccct 
tgaaatacaa 
tgtactagca 
cggcttcagc 
cataagtgaa 
ccaaccaact 
actatcaatg 
caaggagact 
attccagggc 
aggagttcaa 
ccctcttctg 
ctttaaaaaa 
atacttcaac 
aggtgcctgc 
ggaagagaaa 
agatgtcata 
aaaattcaga 
ctttgaagat 
tgaagggcta 
gccggtaaaa 
gttagcagaa 
cttctgattt 
ccagagagat 
aaaacaagtc 
tagacagcgg 
tgagttcttc 
ggatgctccc 
cctccccggg 
cagggtcgtc 
ctaactcccg 
gctcaacgat 
gaggggcggg 
gtgactcttc 
gcgctgggtc 
gtcggaactt 
tcggagctct 
ttcacgcagt 
aaggtaaatc 
gaacccagcc 
gggtcaaaaa 
gtagaaattt 
ctatgcaaaa 
gcctttggga 
gtcaacctga 
ttcgaggtaa 



caggcagatt 
ggctatacag 
gagagatgga 
ctcaaatggt 
aggttacatg 
tgtatactag 
caggtggatc 
cttatcacta 
gcttaaccac 
accaatggtc 
atcctggcat 
ggtctacata 
ttatgaacaa 
aagtaatact 
caaattattc 
ccttaagggt 
gttcaacatt 
tcttcttttc 
cacacaacta 
aaagggggac 
agtagctaat 
gaagctaaac 
ctcattttta 
caaaaaggag 
cagtaacaac 
gcctctagag 
gtgagaagag 
tggtgagatg 
atcccagcaa 
gagtgtatga 
aaaaaagact 
tggtgatgga 
tgtgtaagcc 
caacgtccca 
catatgtgta 
cttagaggga 
agagggcaga 
taaactagtc 
taaggaaaaa 
agaagccaac 
ggctactggg 
gtctagtagc 
caaaaagttt 
catgtttatt 
tgccagtacc 
ccgcggccgt 
tcctctttgc 
tccttacctc 
cgctcgcgct 
tggctagagc 
cctagacgcc 
cgaggccggg 
gcagtgcgcc 
tgcccgccca 
cccctggtaa 
gcgtgcccgg 
agactcagtc 
tcaaggatgg 
tcaatcaggc 
gtcaggtgga 
gtattggaga 
tgtagaagca 
actaaatgag 
aagcaggcag 



tctgaattcg 
agaaaccctg 
ttaaaagcat 
ggctcacaac 
tagacaaaac 
aagctgggtg 
tctgagttgg 
cagttacatt 
tggggcctga 
tctacactcc 
ggtggaacat 
gcaagttcta 
aacatgtaaa 
tcaatattgg 
aaacaacctt 
ccaaggtggc 
tctacagccc 
ttctaagctg 
gcaaacatta 
tgaaagtagc 
atttaccaaa 
actaaatcta 
caaacaaggg 
ttggagtctc 
tgagcfcgaac 
caaaaatata 
tgtgctgact 
gctcagtggg 
ccacatggtg 
agacagctac 
attccagtgg 
ctaggtgtcc 
cagctacctg 
gagttgtcct 
catactacac 
cataaaggat 
tcgaaggaag 
tgcagagggc 
gttgtcaaca 
aagctcaaac 
aagccactgg 
aataatgagt 
ggggtgatgg 
tgtcttgttt 
gggttgcctc 
ataatttcct 
accgctccaa 
gatggccccc 
cttgcaccgc 
aactgtgcgt 
gatagccacc 
ttcgaggcct 
tgtgtcagcg 
gtcctttggc 
ctgctgctaa 
cgccggtgcc 
tccgccctct 
tagacgcgat 
tttcgagagg 
ttccagtgag 
gcctggcgtg 
gcaaaagttc 
accctgaaaa 
atctctgagt 



aaggcagcct 
tctcaaaaaa 
tggctgctct 
tgtctatatc 
atccatatgc 
gtggtgcatg 
aggccagcct 
ttataacttt 
agcttttgtg 
atcaccatca 
ctttagcccc 
ggatagtagg 
gaaagaaaaa 
aaatagcttt 
ataaacgttc 
agcctcttaa 
taacgtattc 
cttagatgtc 
tctagggact 
tgcaggctac 
taaacatgta 
gaaactcagt 
aaagagtttg 
caaatgcatg 
aggaacactg 
tttaaaaaga 
gaaattgact 
taagagcacc 
gctcacaacc 
agtgtactta 
ggatggaaaa 
agagtcgggc 
agctcaatcc 
ctggcatccg 
actggcgcac 
ttgctctgat 
ctcagcagga 
cttggctcga 
tgaaacacag 
aattaggtcg 
tgacttcggt 
taggaggatg 
agaaagggag 
tcttagatgt 
cagcatcctc 
ccctgacgct 
ggccgcgctg 
tcgctcaggt 
ctgagcttct 
gccgatccgc 
gcattggcta 
agtggcggga 
cggtgctact 
ggacagacag 
atatagtcaa 
aggcccagag 
tgagttccac 
atgggccaca 
cgatgcggtt 
gatagtgatg 
ctggctggct 
aaggtcaccc 
atcaaaattt 
tcgaagccag 
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ccaggctaac 
ggctatatga 
atgaaaatat 
aggtaaacag 
tgtacaacca 
tggtaccaca 
agaaatgatt 
cattaataga 
accggtggat 
gcaggacaca 
tcttcagtaa 
catttggtct 
gggcatttag 
ggtaggcact 
gctgtcgtcc 
agctcattca 
cttctcaagt 
nnnnnnnnnn 
tccaagtgcc 
tctgtttttg 
accaggctac 
cactaccacc 
ttatacgtgt 
ggtgaaaagg 
tagctgcttc 
attgaagaag 
cagtattgat 
atgtgctgag 
ctcgcctgca 
aaccaggtga 
tcaccttgcc 
agcaccagcc 
ctggtgccct 
agcagctcga 
ccaatgccat 
tcagcagcag 
ctcggtagct 
gggatttgca 
gcctagattc 
atgtagagtt 
tgaggaagcc 
tccctggaat 
ccgatagaga 
ggcatttttc 
ctgctgagag 
cccaggccca 
ccgtgtgttc 
cgctcttctc 
ttttccggaa 
acagaagagt 
tgagctcttc 
ttggccaggg 
acagtgcttt 
ctgcttcact 
aggtctgcat 
ctgtctgctt 
tcggcccttt 
gattgcaaat 
ctagtaaagg 
gctatgtgaa 
tcagttggaa 
aaaaaataga 
tgaggtcact 
cagccttgat 



ataagatccg 
actgaaagaa 
ttaacttcat 
gtctagaaag 
caacctacta 
ccacagatta 
ttggttgcgt 
aagctgaatc. 
gaatggaagt 
attrtctacag 
gtgaatggaa 
gaatatctct 
cagttactgt 
aactgctttc 
cagtgctgtc 
gtcctaatgg 
tctnnnnnnn 
nnnnnnnnnn 
actttactgc 
agacagggtt 
cctcaaattc 
acccatccag 
gtttatcatc 
attacaaacg 
tttcattaga 
tctcttcctc 
aactgttctc 
ctgtagcaac 
ggaccttcag 
gattggctcc 
ttactgagca 
acaagatggc 
tgaaagtctt 
gcaccagaaa 
catctgattc 
cagcaagtcc 
agcctgagct 
gaactcgatg 
tgaagttgac 
ggttgttaaa 
tcatctaaca 
gcccggctgt 
ccttgcaacg 
ctttgaatcc 
cttcttgtca 
tgctacctta 
agtgcggccc 
cctcccattc 
tgagacattt 
ctccaggtgg 
aaactggccc 
cacacagtgc 
aaagcatgtt 
gatagacttt 
ggagcttccc 
aatcacacct 
ccctggtttg 
ggcatttggc 
ctaaacctac 
ggctgagaat 
gcacccttgg 
atacaaggaa 
tgaagggaag 
caaatctcat 



gtctcaaaaa aaaaaagtaa 
agacctggag atcaaaacag 
agttgctgga gtaagaagtc 
actggaatag tagccatcta 
tagtttctca aacagttcca 
actctccttc agcatatcaa 
gtcacttggt aggatgaaat 
ttcatgttct gactttacag 
cctacctact taagaagtgg 
cagagacttt gagcgacatc 
acttcaggga aattttggtc 
tttttatagg agagagtagc 
tggttttcac gtttctccct 
ctcttttggc atgtgttata 
actcatagga tctggtgcag 
ggagaaagaa ccctggcact 
nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 
aatgtgtcac cacatccaga 
tctctgtgta gccctggctg 
actgagatct gcctgcctct 
ctttctatat tggttttcct 
taaagtcctg gtcccaaaag 
cttatcaatt ctgttcaaaa 
attgctatca gaattcactc 
atcaggtcag tgactcctta 



gcttaggaat aaaaatagaa 
gaatcttgca caaactctgt 
caacaggtct gcatggccca 
ctcgccctag gcctcagccc 
gtcctcaata aatgactgag 
tatgtaggcc agtgagacca 
cctgatggtt taagctgtgt 
ggctgtctga cttccatggg 
ccagcttaag gacctaagct 
tgagtgctgc ctgggctcgt 



gagagctgag agaaagaaag 
ttagaccctg acatggtagg 
ctttagccta gagtcaagaa 
tgttggttgg aaaatggatg 
cctcttttac tgaaagagaa 
tgactccgtg agataaagag 
tggtctctcg tgtccaggac 
atagagcaag ccattcagca 
gcagagcaca ccgtactggg 
ggtatgctac cttaggtata 
ttgccttgtc tgtttggttc 
tgcattcctt gcccccagag 
ctcttctcac agggaactgg 
ttgaaactcg ccatgggcca 
aagtcctatt caatgggtag 
tctgtgtttt cctaggtgct 
gaaactcttt tatttcctct 
aagctttgaa ttccttcctg 
agggaggatt tggaggcagc 
cccttggctg cctcagtccc 
cctcttattc cttttaaagc 
agtgtttaga gacagttttg 
agtgcacagg accgcctcca 
ccctgctttg gagattaaaa 
tggcatgcac aaggccctgg 
gagtaaccct aatgagctgg 
tctagcccca atttagtatt 
ggtatacatt ggtaagaaaa 



taaaaataaa 
aaaactgagc 
tggaaaactt 
ctggtatttt 
aagaatatgt 
cagctataga 
ctcgattttc 
agctgcggca 
gcttcactcc 
ttccttcctt 
tggaaaatgt 
tttatattct 
agtctgtgat 
cttaaggaat 
gttgtgtagc 
tggttagttg 
nnnnnnnnnn 
nnnnnnnnnn 
gttctgtgtt 
tcctggaact 
gcctccagag 
atggcgtttt 
gagatgagag 
attaaacctc 
aggccttgtt 
gctcaagtac 
ctgacttcca 
cagcagggac 
gaagcttctc 
ttcccttgtt 
gacttgaatt 
gactgtgacc 
ctgctgcgcc 
ctttgtgtgg 
ccgagaaggt 
ggtgtgactc 
gactcctctc 
aatctgtttt 
aactaatgat 
ttagaagccc 
aacataagca 
gcattttcac 
tagatctgta 
gttgttgcgg 
ggaaattgaa 
gccggagttc 
tctcttgcca 
ccttaggcta 
ctaaagtctg 
tccagtgttg 
gcctttcttt 
tctgtgtatg 
ttaggacaga 
aggatgtgga 
ctcacccgcc 
tgctctctca 
agtggttttc 
attgttatgg 
cagtggagag 
aaggaagctg 
ttctgtccct 
tccctcaccc 
ttttgtggct 
agggtttgaa 



aagggagaga 
cgtctaagaa 
tgggcaacta 
gatctctgtt 
ctgggtgaat 
aaaccccaga 
tagaactatg 
gcatggatct 
cgaagtctgt 
cttcttccct 
tctgccttgt 
ttatagtatg 
tactagaatg 
gtagtatctt 
tgcccctaga 
agacccanaa 
nnnnnnnnnn 
nnntttagtt 
tgtttatttg 
cactctgtag 
tgctgctgta 
aaaatagcca 
gggctgctaa 
agagtgggat 
tgcgtgtgtt 
atgcaatatg 
cagggaaatg 
cagctagtct 
cgaaccggta 
tattttggta 
taattatccc 
agctgttact 
agatagttct 
ctccagaggt 
tgctctgccc 
aggagtagag 
ttttcagaaa 
gactattcta 
tacaggagga 
agggtaaatg 
accaacagct 
tttgacctaa 
tctgttgtga 
tgccgggagg 
gatggcctgg 
tccttccctg 
tctgaattga 
atggtgtttc 
ctgcccatgt 
aaattggcca 
ttcctagtgt 
gctttttgct 
tatttgccct 
gaaagccatt 
tctagcattc 
actccagggc 
aactagaagg 
ctgccagcat 
acccaagtta 
agggaaccac 
agctctgcac 
agtgtgccac 
gccatacctc 
acatagacct 
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gatactcgga 
ggcaatatca 
ctggataagc 
ccagagtcta 
tgtctcttat 
gtgccccctg 
aggagcccac 
ggttggcagt 
actgtggccc 
actcgtcatt 
gagctctggt 
ccttcccacc 
cactcacctg 
atgcccagaa 
cgggcttggt 
caaggccagc 
tcttgaaaaa 
aaagggttgt 
tctcactaaa 
gtgctgtgtt 
aatatatagc 
atatcctgct 
ataaatacgt 
accagtgtcc 
agcactgatg 
acctgaaagc 
gtctagcctc 
ggccaccttg 
aagtccaggg 
atgagtctgt 
gggaagccgt 
cactccaggg 
ctcattggta 
tgctgctcat 
ggtagtaata 
gtacattatt 
catacactct 
tgtcatggca 
agaagctggt 
gagtctcctg 
tttggggtag 
tagtccaaac 
gtgctcttcc 
aagctttttt 
ggaagtcctg 
tgatgccctc 
caaagcactc 
cttgagaata 
ctattatctc 
aaaacacctg 
ctgcttctcc 
tgtaactcca 
attatacaga 
nnnnnnnnnn 
nnnnnnnnnn 
ctctctatta 
atccactact 
ttcaagatct 
aaaacagaaa 
caagttgtca 
gggctgtagc 
tggggtttga 
aatgacatat 
aggtctctct 



catggaaaca 
gagagagggc 
ttttataaat 
tcacaaacat 
ctatactttc 
taagaccctg 
tttcgttgag 
gcctgaagct 
tgagagtgag 
cgtccctgtg 
gatgactgag 
tgacctgagc 
ccatgctggc 
aagactaaat 
ggcacacgcc 
ctggtctaca 
ccaaaaaaat 
cactgatctg 
cgatgaatct 
ccactccagt 
aagaaatcag 
tccttaaatt 
cttactgttg 
taattcttac 
caaggcaacc 
agaccactct 
aaacaacatc 
cccagcttga 
cactccagga 
ctgcagggca 
gagtctgtct 
ggagccatga 
gactgtgagg 
ggtttccaac 
gtccagacca 
gctgatgacc 
gctttttaga 
ttacagtgag 
gttaaaagac 
ctgtctgtta 
agctttttca 
tcatgccttg 
tcaaaggata 
tagatgcttt 
ggtttgattc 
ttctggccta 
atacataaaa 
cctgtgctta 
agcactcaga 
tctcaaaagt 
agagggcctc 
ggtccatgag 
gagagagaga 
nnnnnnnnnn 
ntctagagtg 
tgctgcctgg 
tctgcctctc 
tagatttcca 
gaaatcaggt 
aaactttgct 
caacttggtc 
attgatttga 
tacaaggtgt 
gctcaagctc 



gtatgtttgg 
atcagtcggg 
gctggttaag 
acacagtttt 
agatggtaag 
ggggatgcag 
actaaggaag 
ctgagtgcca 
gaggtccggg 
gggcccttgg 
atgttctgtt 
ctagtgtctg 
tgctgcttgc 
ccagtatcta 
tttaatccca 
aagtgagttc 
aaaaaataaa 
caggcagctc 
gtttccctgg 
gagaacttca 
tatgactgct 
tttcattaag 
gcagtttctt 
tcagtggctt 
ctgtagcatg 
ggctgtttca 
ctcctgcctc 
gactgcttct 
ggagccgtga 
ctccaggagg 
gcagggcact 
gtctgtctgc 
ctacatctcc 
tgagacagag 
caagggacag 
agtttgtttg 
acttgctcat 
gaaataaaca 
agtattcaaa 
atttaccctg 
ggaaaaaagt 
tgttttattc 
tatttaacgt 
gtaagatggc 
tcagcagcta 
tgtgagcact 
ctaaattcaa 
tattataatc 
agattgagac 
aaaagggctg 
atttcagttc 
atccaacacc 
gagagagaga 
nnnnnnnnnn 
tttcaggttt 
aactttctat 
agtactggta 
aagaagccgt 
acacttagag 
cttaattcta 
gagcccttgc 
acccagtaga 
ggccttgttg 
tgcccagggt 



tcagagagag 
ttagacacga 
gttttgaatt 
ctcaaatctg 
accctgaggg 
tggacccgac 
ggaactgaca 
tctggaaggc 
agctccggga 
tgtagagcaa 
ggcttggagc 
gcacagagag 
aagagctaac 
aaggctgctt 
gcactcggga 
taggacagcc 
aataagtaaa 
atgctagcct 
aacattccta 
attcttagtc 
tacctcaaga 
actggtgata 
cctgggatgg 
tcattgagtg 
acttcatagc 
cttgcagaca 
agccttccaa 
tactgccatg 
gtctgtctgc 
agccgtgagt 
ccagggggag 
aaggcattcc 
agatgccccg 
ggaaggactt 
atagctatgg 
gatgagttag 
cctagccatg 
gtcctgaagg 
catctgcgga 
acaaggaggt 
ttagtcctac 
ctaaaagccc 
tttatatatc 
tcagtggtta 
acaccagctg 
gcatgtgcgt 
aaaactcttt 
tagtatgagc 
tagatggtca 
aaaaagtgtc 
ctaataccca 
tctggtcgtc 
gnnnnnnnnn 
nnnnnnnnnn 
tttttgtttt 
gtagaccagg 
ttgaaggcat 
agcctagaaa 
ctgtagaatg 
aagagagaag 
atgaagctgt 
ttcgtatatt 
gaggaattgt 
agaaagggag 



cgaaggacct 
gcattccaca 
tgcccaattt 
ctttgcagta 
cagaggaact 
tttgtgttct 
agcttccctt 
agaaaggatc 
aatgaggtcc 
tcatcctcac 
tctcatcaga 
cacttgaaaa 
tgccctctga 
taaagggttg 
ggcaggcgga 
agggctacag 
aaataaataa 
aggcttttgg 
tggtttctag 
ttgtattata 
gacatacaat 
tataatttgt 
caactctgta 
ttcttggcac 
acaggcctcc 
gaatcttact 
gttctaggtt 
tctcttcagg 
agggcactcc 
ctgtctgcag 
ccatgagtct 
aagagcagcc 
agtgctgtgg 
tgcccctttc 
ggttttctga 
tgggaaagaa 
cccaaggagc 
tgcctggcag 
ctgggaactg 
gacttgaaag 
agacaactct 
tgtcacactt 
aggccttatt 
agagcatgta 
ttattccagt 
agtgcacaga 
cattgtctca 
cagccacggt 
agagctagag 
tcagcagcta 
caccgagtga 
tgcataagct 
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nnnnnnnnnn 
tttttttttt 
ctggactcaa 
gtgtcaccac 
aggttaataa 
tcagcatgtg 
ctgtcaaaag 
gtgtttactc 
tgaatgttta 
caatttgggg 
cctcctcctg 



gatagacgag 
gtgagcagct 
tgtcaggatc 
tgcccgtgaa 
cagacccttt 
ctgcacagaa 
tctggcttca 
ctgttggtgc 
tgcttactgg 
cctcaagaag 
gaggacggga 
cagattgaga 
tggaaacccc 
tcactgcagc 
tttctgagtt 
agaaaccctg 
ataaataaat 
ctcgatttca 
tagtaatgaa 
attgaaaaat 
tccacttaca 
gaatggagaa 
ttggtttcct 
tcactgtcca 
ttgttagcac 
ctgtaagcca 
tataggaaaa 
ctcacacatg 
agggggagcc 
ggcactccag 
gtctgcaggg 
atgggcgtca 
ttgtgagcac 
cctaaggatg 
ctcatcctta 
gacccaagtc 
agccgttgac 
cagcttttca 
ggcagcattt 
gtttgttttg 
atagttattc 
tgtaaaatag 
ctgtgcatgg 
ctgctcttct 
tcctgggatc 
caaatgcagg 
tgtgacctag 
agcaacacac 
tctgggttac 
agagcacaca 
ctcaaccacc 
cctacactca 
nnnnnnnnnn 
nnnnnnnnnn 
gagacaaggt 
acttatagtg 
accccactac 
gtactgattt 
acatttgtga 
acttgaactg 
cccagcactg 
cctcatgggg 
gtgagctttg 
gctgtctaca 



71460 

71520 

71580 

71640 

71700 

71760 

71820 

71880 

71940 

72000 

72060 

72120 

72180 

72240 

72300 

72360 

72420 

72480 

72540 

72600 

72660 

72720 

72780 

72840 

72900 

72960 

73020 

73080 

73140 

73200 

73260 

73320 

73380 

73440 

73500 

73560 

73620 

73680 

73740 

73800 

73860 

73920 

73980 

74040 

74100 

74160 

74220 

74280 

74340 

74400 

74460 

74520 

74580 

74640 

74700 

74760 

74820 

74880 

74940 

75000 

75060 

75120 

75180 

75240 



88/127 



BNSDOCID: <WO 0229059A3_IA> 



WO 02/029059 



PCT/US01/31488 



gaggacatag 
ccaagtctgc 
aactgtaagc 
ctcttcataa 
cctttaatcc 
ggtctacaga 



aaaacaaaaa 
ggagtacaaa 
tcaaaacgaa 
tcttgctttg 
gcattctcca 
gcctgcctgc 
agacatgcct 
acaaatggtt 
agccagtgct 
tggttggttg 
tggaacttgc 
ggataaagtc 
tcattccagc 
tgatcaccct 
tctgagtgtc 
gagaatgagt 
gggggggggc 
acagcctgca 
ccacctccat 
gccttaggat 
tggagggtcg 
gagcatcctg 
aaagcaacct 
cacagaccat 
ataagtgggt 
taatatgctg 
gccccagggc 
aaagcaacgt 
aggggaggcc 
cagttcttga 
tgtattgtag 
gctcatgggg 
ccagccccac 
catatgctac 
tcaccaccat 
ggggaagccc 
catttccctc 
ctcctgaagg 
ctccagaggg 
atgagagctg 
catcttcctt 
tggattccac 
cctctttctg 
cctgagcaca 
tgaagcaact 
ttatgagttg 
ctactgactc 
aacagaggca 
gtagagagcc 
gatgccaagc 
tccttttgtc 
tcttcagggc 
aggttatcag 
ctaccctgtt 
taaatgacta 
agtggtaggg 
gcatggaaac 
cacccagaga 



tctcctggct 
ctgcacaatg 
cagccccaat 
caataaaagc 
cagcacttgg 
gtgagttcca 
caaacaagca 
gtcgtctcag 
agaaacgaat 
tttttggcgg 
ctgaggaaca 
ctgcttgtta 
ggtatctgca 
gtaagctgcc 
cttgttgttg 
gttggttggt 
tctgtaggct 
atgtgccacc 
ccttctttgt 
ccctcagcct 
agatcttgaa 
tctgataact 
aagccccaga 
catctcagtg 
ccctacagac 
ctgacatcat 
tctttctcga 
agccactgtc 
gagctgtcag 
aggctccatg 
gttctctctc 
gtgtgtggta 
tttcctctaa 
aaaatggaaa 
tgtggaaggg 
gcaggggtgc 
agtgtctggg 
gcagagccag 
gcagccattt 
aagcactgtg 
caccacacac 
tcacccttga 
gtggaccggg 
tctactcgag 
gcaacaaagc 
aatcagcacc 
tgacttgctg 
cgtttgcatt 
ttcctctgtg 
cacagcctat 
tcacgccagg 
ggggaccagc 
ctctcaaaca 
aggagctcct 
cggtgaatga 
ttcataactg 
ctctgagagc 
gtggaaagaa 
agctacagag 
ctgaggagga 
aaacacgata 
attgtgtgtg 
tctgagtggg 
aagggcagag 



gccttcagat 
ccatgcttcc 
taaatgtttg 
ctaactaaaa 
gaggcagagg 
gaacagccag 
aacaaatgcc 
ctagtatgtt 
gaatagataa 
tgctgaggat 
ccctggcgag 
tgtgtatgag 
gaggccagaa 
atgtaggtat 
gttttattgg 
tttcttgaga 
caaactcaga 
aactccagac 
gttttgagat 
cctgactgct 
aacccattct 
atttctcact 
aacattgata 
gccatctgtg 
cttgcagttg 
ggggactctc 
gtcaaacttg 
cccagaaagg 
cagcaagaac 
caccacgggt 
tgacttggat 
cgtgcctgct 
gttaggatcc 
ttaatcaaac 
tctcctgagg 
gactccagga 
agggagctgt 
gattaaacct 
cactggtcaa 
gccttcagat 
acacacacac 
agtaagccac 
caatcttccc 
ggttgcccgt 
tggatttgac 
ctcactttga 
gggcatctgc 
ccccgtccca 
gtctcactgg 
tgtttagaca 
cataatgggg 
tgagactcta 
caattctgga 
tgtaggtgct 
cagactctgt 
cccgtgaggc 
ccgaggaggc 
ataatgaccg 
agaagagtgg 
gccagctctg 
ggaggccact 
tggcccccac 
ggaaattcag 
agggccgtgg 



caagatgtag 
taccatgatg 
tctttataag 
cacattcctg 
caggaggatt 
ggctacacag 
agcatttggg 
tgaggccagc 
acatttgagt 
taaacccagg 
tgcctagtct 
tggtaacctg 
gaggatgttg 
tcagaattga 
gggcaggagg 
cagggtttct 
gatctgcctg 
aagcagccag 
ggtcacaaag 
gggggttaca 
cgtgaccttg 
cctcttcaag 
acttgcccaa 
ccatatccac 
agatcagagt 
acggtcctgt 
gttacccact 
ccacaggtcc 
acaggagccg 
tctggctatc 
caccacgctc 
ctcctgtccc 
actcttgcta 
cattcctgcc 
ccttaccact 
gttgggggct 
gttatcctta 
ggtctgattc 
ctaattcaga 
ctccctctgg 
acacacacac 
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tttcatctgg 
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gccctggaag 
acgctgtgcg 
ctgggtccca 
ctcctgtagt 
cttctgttta 
ctggggactt 
ctactccata 
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gctctgagag 
ccgtgaagat 
aaaaagccag 
aacacttggt 
cctggtacct 
acactcggcc 
aggatgctga 
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tgaagttctc 
ctgtacccct 
aggtcgagtg 
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ccgttgggct 
ctgccagctt 
tgctgtccgg 
agtacatcac 
gaatgagtgt 
gcaacacaaa 
tttaaaataa 
gttcaaggac 
acactgggga 
tcagcgtatt 
ttgactacta 
cacccctctc 
aaacccaccc 
gttctgctgt 
gaggaatttg 
gcttttttgt 
aatgcttggc 
tttttagttt 
gacaggattt 
cttagattca 
ctgtacctac 
gtaacatgtg 
gaataagtga 
acttaaagcc 
cccaccccta 
tattcttgtc 
tggatccagg 
acctcagagc 
aggagtagca 
ccctcaccca 
gcagagtgag 
tcttccgtcc 
atcaggagac 
tcaggcccca 
cggccagctc 
atagacttac 
ctatgttaaa 
tagccttgaa 
tgagccatca 
gggtatgaac 
nnnnnnnnnn 
nnnnntctta 
tgtgtcttcc 
gagggggtag 
taaggaagtt 
gtgagatggt 
tctgtgggta 
tagtggggcg 
caaggggctc 
actcagtagc 
acgccatccg 
gccgagagat 
aggccaggaa 
gggccccttc 
ctgacctgcc 
gtttcgggga 
ctggatgtgg 
gctggtcggg 
ggcctggaag 
gggagtaccc 
ttctccaggc 
tgggcacggg 
aaggccttgt 
tgctggaggt 



ggcactggat 
gttcacaggg 
gactggggag 
ctcctggttg 
gtgctctctg 
agggcttgcc 
aacttgtagg 
ctgggttcag 
ggcagagatg 
ccatctcagt 
gcctactctg 
tgtccccttc 
tcagaccagt 
gacgccctcc 
tcgttttctg 
tttgttccca 
cacccagctg 
tctgtttggt 
cgctatgtga 
gagatccacc 
caacctcaaa 
tgagggcctg 
ctggctagtg 
cacaatccag 
cttcctccgt 
tcgatgttaa 
gctgcctccc 
tgctggggac 
ccagggcctg 
caccctcccc 
cccttcccct 
ctgacctgcc 
ccaccttccc 
agacaatgct 
ctttggtagg 
agaacatctg 
gacacacaca 
gttgcagcta 
tgcctgactt 
cagggccaca 
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tttttagctt 
atttgcttct 
gactcaagaa 
acaggtgggc 
gcaagatttg 
aggatgacca 
gagccccaga 
ggagactagt 
cgaacttgat 
ttccatggct 
aggggcgtgg 
gccccaggca 
gaggtcccgg 
cagaccatgg 
ctgcactggg 
cggaacattt 
tgcggaactt 
actgaagttg 
agtgggagaa 
tgtggcagtt 
tcctagtagc 
tggaccggga 
ttgctgtgag 



ctcaggcggt 
accttgaaga 
ccctgggtta 
ggttggtcct 
actccacaac 
agacctgaac 
gctggggaga 
tccccagagc 
ggcaggtcct 
gaggggtcct 
tgtctgtttc 
cctctgcagg 
cttccctatt 
ctgtctttcc 
acctgttttc 
gggctttgga 
tacctcccgt 
ttttggtttt 
ctctagctgt 
tgcctctgcc 
ggaagtgtcc 
ggcttcaccc 
gcaagacagt 
gtctaatatg 
atagtggcag 
agtaaccaag 
tgcactgtct 
gttctggctg 
catcccggaa 
tgccactgca 
ccctccagct 
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atctaagctc 
ctagcacaaa 
atttggaacc 
gtctggtcct 
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gtttgttttt 
tctacctaat 
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agagctcggc 
ggccctcagg 
gtaggtgaga 
ctagtcaggt 
ggtgttggag 
gccagaaagc 
tcacccgagg 
acaagccagg 
cagcagggtg 
gcaccgggtg 
catcctgggg 
ggctgcagct 
tcgtttcgct 

gggg a ctgac 

gaaccttttg 
tggatctagg 
tcctcaaatg 
agtgtcagtg 
attctaggtt 
gaattaagga 



acagatcgta 
aggggtcctc 
gaatacaagg 
ctagtccctg 
ctgtgtgctg 
tgtggtctga 
tagcacagta 
tgggcacggg 
ggggctcatt 
gtttcagagg 
tgtctgtctg 
gaacttcctc 
cagcttgctg 
tgacaagcca 
agtgtcagcc 
tcaaagctgg 
ccctgttgct 
gattttttgt 
cctgggactc 
tcctaagtgc 
atggatagag 
ccaacagaga 
attgtctaag 
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ctcctctgga 
gcctagaaga 
cttctgctgt 
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cattgcacac 
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ctgtctcagg 
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tctgggagaa 
gtcaccgacc 
ggtgagtgta 
cctcccccat 
cctgagaaaa 
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ctcctccacg 
accagtaggg 
gggccatgag 
ggaggtgggg 
gaacctgaag 
gataaagtac 
ggtgcatgct 
ggccaatcag 
gcctgaggaa 
tctgtctgtc 
accaccacca 
gctggtccta 
tcccctctga 
tcttccttat 
gctcttacat 
tttcggtttg 
tgtttgtttg 
actatgtaga 
tgggattttt 
ttcagtacta 
agggggagtt 
gtcactaagc 
ttgtccttgt 
tcctgaaagg 
gtgaaggcca 
cccacctacc 
cgagcagggc 
aacaagagag 
tgcacaggga 
agtggaccct 
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aagacccagc 
ggcttttccg 
atgggagcag 
aatagttctc 
cttgtgtatg 
gagtactaca 
agagcaattc 
nnnnnnnnnn 
nnnnnnnnnn 
acctggggag 
ggtgatgttt 
tccgtttggg 
tggcgtgctc 
gttggccctg 
ccagaacaga 
tagatgactt 
atgtgttgta 
taaacccaag 
gctgcgccaa 
gcacgctgga 
atcccgattc 
agaaactgac 
gctggcgctt 
ccgaaatctg 
ctactctctg 
ggggcggggg 
ggtttgggtt 
attgacctta 
caattaatga 
cagaaaaggc 
ctttggaatg 
gtgggacttg 



gagacagctg 
ggctcgcccg 
aggcacagtg 
tctgaggtca 
agtgggaagg 
cctggcccac 
cagcatgcaa 
tataatccca 
cctgaactaa 
tgactctggg 
tgtctgtctc 
acccccaaag 
gtctgcctag 
ctagacccga 
gagactttct 
acgttaggca 
gaggactttt 
tttgttttga 
ccaggctggc 
agattttaat 
catcatgtgt 
agtaggtgag 
cttaagccac 
gtgacatgac 
agagggaaga 
aagcccaccc 
caccctactg 
tccagtccag 
caaacgactc 
caagagtcag 
tgacttgagg 
gttggttcct 
agcaggctct 
tggaggcaca 
agcccaggtc 
tgaccaaagc 
tagcccaggc 
atgccaggtg 
ttagttaaga 
nnnnnnnnnn 
nnnnnnnnnn 
gggaggtgta 
gggggggggg 
tagtactatg 
tgaggtaaag 
taccttctgg 

aggtgaaagt 

tctggaaccc 
aggcctctga 
caggagcggg 
tcaatgagca 
aagataatac 
tgccggaggg 
tgaaacctag 
ccttcacggc 
ttcgggaccg 
gagtaggggc 
tggaaaagag 
gtggatgggt 
gctctttgtc 
tcacccaccg 
tgaatcggga 
gagcagatgt 
agatatctag 
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gctgtgcatc 
cgctgcagat 
ctgtcaccat 
aactcaaggg 
ctgaaaacct 
ctcctgggca 
aaaaaaaaaa 
gtaagagaac 
caggattgga 
aataaatact 
catgcgtggg 
ctgaaccttg 
cactatgtgg 
ctacggaagc 
tgggccagcc 
tggttgtccc 
ctggtcaccg 
ttgcagtctc 
gttgccagga 
ccccagggag 
caaggactag 
gagagagaga 
aactttaaaa 
atctctttta 
aacagtttct 
caatctccaa 
cttgggtcac 
tacctgtact 
aaacgctgca 
gtggcacatg 
ggccagcctg 
agagaaagaa 
gttcaatgcc 
tggctgactg 
aggagcttcg 
atgatttcat 
ttatcctggt 
ttaaggtttc 
tctctggtgt 
cggggggtgg 
aaggtcctga 
aatctgatgc 
taaataaata 
atatctctgg 
gcaccaggcc 
gatcagaaaa 
caggcctgat 
gctggcctgg 
tcacaggtgt 
tggtactgag 
catgggctct 
caaacccagg 
ctggtgataa 
gtatgtgtgc 
cccctttaat 
ctggtctaca 
taaccaaaaa 
ggtctgcaga 
acattaaagg 
tgttttgttt 
ggagactctg 
ccacaaagaa 
gacacacact 
tgtggtgcat 



agctctgagc 
ggagttgata 
ttggcggtaa 
cccaacctct 
atcctctcag 
ctgcccaagg 
aaagccagga 
ctctggtcca 
ggcagtccta 
aatttttaaa 
tgtccctagg 
aaccttgcct 
gcttctctat 
tactgctgct 
tggccaccct 
ggcagcacta 
tgggcagcat 
agccctaccc 
acacttccac 
gtggggctgg 
actgtatcat 
gaacattgta 
gtcccacagt 
aaattcaaag 
tccttcaaga 
ccgtccaatg 
ttctccagcc 
ccactgctgc 
tgaccccttc 
ctgataatgc 
ggctacaaga 
aaaactggac 
aagttggcag 
aacaccttac 
ctgaggagag 
cagtctcttg 
ctgcagtgtg 
tctggctctt 
gtgtgtgtgt 
agggggctgg 
gttcaaatcc 
cctcttctgg 
ttaaaaaaaa 
atccccctgc 
aggctagaac 
cagacccagg 
aacctgcggt 
aattcacaga 
gtgccaccac 
gatagaatag 
ccttccccaa 
tcttgtgtga 
cttatagcct 
ccacacaaac 
cccagcactt 
aagtgagttc 
aaaaccactt 
ccaggctggc 
atgtcccacc 
tgttttttct 
aagtcagggc 
accctctctc 
tgtaatctaa 
tcaagagcag 



gaggagcctc 
tgctgtggaa 
gactctgaag 
ctcttcagtg 
acttggattg 
agtttggttc 
atgaaagcta 
gggctgctca 
aaaaatgctt 
aagctgatac 
ataaggagct 
gcaggtagtg 
ggtatccctg 
ctcccataag 
ggtcctcttc 
ccagctgtct 
aagcatctcc 
gtttatcctc 
tctcagtctg 
gaagaggaga 
cagagagaga 
tgagatctcc 
ctttacatat 
tctttttaca 
gggaaaatat 
tctgggatac 
aggccctttg 
tgctgctctt 
agtcctgggc 
tagcacttgg 
gactctgttt 
atgacagccg 
tgcctagttt 
ccagcaactt 
aacattgaac 
tgaatgtgat 
gtgtggagtt 
ttctttcctc 
gtgtgtgtgt 
agagatggct 
caacaaccac 
agtgtctgaa 
aagaagttgg 
ctctacctct 
attctctgat 
gggccacaga 
tttttgtttg 
gatccacctg 
acttggccta 
actct caeca 
aggaaaaata 
tggctcagca 
tgttatgctc 
acacaatcaa 
gggaggcaga 
caggacagcc 
taaatattat 
cttgaactca 
actgcctggc 
aaaaaatttt 
cagccaggtc 
aacaaacaaa 
geccttgaga 
cctgggtggg 



atagtcttct 
ccagacattg 
ggagaggeca 
ttctgtcctc 
agttggaggg 
tccccacaaa 
gggtatgeta 
tgccccctag 
gggtaatata 
ctggaaggat 
gggacttgtt 
agctgcctca 
cttctggagc 
gccccatcct 
cgccttctgc 
ettgetctgg 
acagggatcc 
atgeacaagg 
aaaggtgtgg 
tggtagecca 
gagagagaga 
attacagtca 
tcttaaaatt 
attaaaagtc 
cagggcacag 
aactcacgat 
tagcacacgc 
ggtggtcatc 
cttcaagaga 
gaatgacaag 
caacaaacaa 
gaacattatc 
gataagggtc 
gctgatagac 
cctattgtat 
tgctttgagt 
gtggaagctt 
ccggtttttg 
gtgtgtgtgt 
cagcggttaa 
atggtggctc 
aacagctaca 
catggatgat 
tagacactgt 
ctacaaatac 
aatgtcttag 
ttctgggggg 
cctctgcctc 
actgectgag 
gctcttctga 
aatgtaagaa 
tcaaagctac 
tcctttgacc 
gaaataaatg 
ggcaggtgga 
agagctacac 
ttttattttg 
cagagatcca 
taaagattta 
tttaaaaaga 
tactgagtga 
caaaagagaa 
ggctgagaag 
ctacagaaaa 



ccggtgcctt 
gcccaggcct 
ggtagtaagg 
tgacttttcc 
aggttttgac 
cctccagctg 
tgcaaatagt 
ataagggtca 
aagtgaataa 
gaggcagaga 
gggcacaggt 
gcaccgctgt 
tgaactccat 
tggccttcag 
ctctgggatg 
ttctgctttg 
gaattctgac 
aaaccaagac 
aagttttctc 
ctgcatagtc 
gagagagaga 
ggaaatcagg 
tcaatctctt 
tcaactgtgg 
tcacaatcaa 
cttctgggct 
gtcatcctct 
tcatggtact 
gaagactaga 
cagaaggatc 
acaaacaaac 
tgacattcat 
tagccactct 
tctgccttcc 
atgaataaat 
catttttctt 
tgagttggga 
ttttgtttgc 
gtgtgtgtgt 
gagcgccaac 
acaaccatcc 
gtgtgcttac 
gtagtgaaga 
gagtatggaa 
ctagagtatt 
taggtaaaaa 
egggagagge 
ctgagtgtca 
tttgagcatc 
cttccacatg 
ttaaaaaaaa 
ctcccgccac 
tccacgggca 
cagccaggcg 
attctgagtt 
agagaaaccc 
ttttgtttat 
actgettctg 
ttttttcttt 
accatccctc 
gctctagggc 
cagacccaac 
ttcaaggcta 
agaaagaggg 



caggttattt 
gggaccttct 
gagcatgtcc 
acaaagcccc 
tggctageca 
atcataaaaa 
gtggcttggg 
gcagaaaggt 
ataaaaaata 
gtagaaaaaa 
catgeaaage 
tgtgtctggc 
ctgtttgcat 
agtaagcagt 
gatgagtctg 
tgtggctggg 
caaggatatc 
aegtgagect 
ttctgtcagc 
tactatgtag 
gagagagaga 
agatctaaat 
taaaatatcc 
gctccactaa 
aagcaaaagt 
cctccaaggg 
aggctccaga 
ggcatctcca 
gcctggcaaa 
agaagttcaa 
aaaccaaaga 
aaggtcctga 
ggtaatacca 
agcaaaaggg 
tgctgtgcaa 
ggctccagtg 
gggtttcctg 
ctggtggggt 
tagaagttgg 
tgctcttcca 
gtaacaaaaa 
atataataaa 
ctggcattag 
gtgtaccacc 
attcctctat 
cacttgettt 
tggctggctg 
ggtaccagga 
agtactcaca 
tgccctgcag 
aaaaaaaaag 
agctgaccac 
tgctgtacac 
aggtggcaca 
cgaggccaac 
tgtttcgaaa 
cctggaactt 
cttcccaagc 
ctttttgttt 
ctagcactca 
agecaggget 
cagacctgag 
gecacaagtg 
agagagagaa 
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tggttaatga 
aagacgagtt 
agttcaatcg 
tcagccaaac 
agggcaatgt 
agatcctcac 
tatctacaga 
gtgagagtcc 
taatatctga 
gaatggggta 
agaactaaga 
tatgttcacg 
agaggccaga 
cagtccaatg 
taactgtgga 
ggcccatctg 
tggctgggag 
actcacctct 
ttcccctaat 
ggtcgctggg 
tgcagcagaa 
gtttctaaat 
gctatttgga 
taaattttaa 
tcacataatg 
agccatttgg 
attttttatg 
catcaattag 
catccctgtc 
cacatacatc 



nnnnnnnnnn 
tgcctggctc 
tttggtttga 
ctcacagaga 
acctggcccc 
tatctatcta 
ggtagtcaat 
tgttaggctt 
atcttccatg 
accaaccaag 
ccaggaccaa 
tactgctctt 
acaggtcttt 
ttgaactcac 
taccacacct 
tcacaggatt 
agcacaaaga 
aatcaagcct 
ccggaaacat 
caggagccct 
tgtgctcagc 
gagtctgaga 
tagatggggt 
cagtagacct 
tccccggggt 
acacatcatc 
gaaatggctg 
ctctgagtct 
ccaaagaagg 
ggaccctgtg 
gttgggcaga 
gtaacaggaa 
cagctaggfca 
gctccgtgca 



agatgactct 
gagggtggag 
gttcaggtac 
caacgcagcg 
cactttagct 
acaccttctg 
gataagggta 
ctggaaaggg 
cattcctgga 
taaaccagga 
ttagaggcat 
aatgttagcc 
agaaggtgtt 
tgtgtgcfcgg 
ggcatatctc 
cgcatgcgct 
acttgcagat 
tctctgtagc 
cctaaatctt 
gctctagaga 
cttttttgct 
ataaccccat 
ataattaaaa 
atgttattat 
aaattaacca 
agtggaggcc 
cacctatcta 
ctttgatcct 
ctcttttcta 
actgtccann 
nnnnnnnnnn 
tgttttgctg 
ttttgttttt 
tctcttgtct 
tttcatctat 
tctaaaattt 
aagtaaccct 
ggtagcaagc 
ggttctgggg 
tcattttccc 
atctagagga 
ctcccttccc 
tctctgcatc 
agctgtcctc 
ggctcatttt 
cacaaaaact 
gttgttgaaa 
cacaccctga 
tatcctctga 

tggagggaag 

cattttattt 
ggaagtactc 
cacttagtgg 
gagtctttat 
cctgatacca 
caactccaag 
aggaaaacag 
gaagagggtc 
aacacggcag 
cgaggctgga 

g a ggg9 ctac 

aaatttgggg 
gagtacacag 
gagtaaaacc 



ggaaaagtga 
aaccgccaaa 
attacttcca 
actattgact 
ttcctttgca 
ccagatccag 
caaggcgtgt 
ctccctgccc 
aatgtatgta 
tggcttcaca 
gcactaccat 
tgcatgtatg 
ggttggattt 
agatgaaact 
cagcctcaga 
ggagacctcc 
cccaagtggt 
cattggctga 
tgccctctcc 
gatggggggg 
gtatattgag 
ctgtttcaac 
aatttcaata 
ttttaaaaag 
tacgcaagtg 
atggcgcttc 
ctactgataa 
ctgtcataac 
catcttatac 
nnnnnnnnnn 
nnnnnnnnnn 
ttgttgtttt 
ttttagagaa 
ctgccttcca 
ctatctatct 
atctgtgtgt 
cagaagttgg 
atgtttaccc 
acacaaatgt 
agccccatcc 
gtgtccatgc 
ttccttcatt 
ccaagctagc 
ctacttcagc 
tcaaataaat 
ttggaaggag 
gaagagagaa 
gttcaattcc 
cctccagacc 
tcctagaaat 
ttcctttgtg 
actgtgtccc 
tagccttggt 
atctgcaaag 
ggttgtatta 
tggctttctc 
agttgcagaa 
agcgactctg 
agggagagcc 
ggggaggaag 
attcatctgc 
ttgagggggg 
taactcccag 
cacacagggt 



aactcaagag 
gcggacgagc 
aaacgccatt 
tctgcatttc 
gaaaggaaaa 
acactggtat 
gtgtttaaag 
tcaccatcac 
tgtattcatt 
tctcagagac 
acttggctca 
tgtgtgcacc 
cctggagatg 
tggttcatcc 
tttcccagtt 
tttaccgcct 
acaagagaag 
gcatactttg 
aaatgtgctc 
gggggagcaa 
tcttaaaaat 
acaaaatgca 
cttgatttaa 
ttacaagttt 
aggctcggtg 
acattgaatt 
caaaacaccc 
tcgtgtccac 
tgaatccaca 
nnnnnnnnnn 
nnnnnnnnnn 
tgtttgtgtg 
tcttactatg 
agtgctgaga 
atctatctat 
gtctgtgtgt 
ctttctctaa 
actgagccat 
gtactttgat 
tgactcccat 
ccaacaaaca 
tcttcgttcc 
cttgaacttg 
ttcccaaaca 
aaaaagaaaa 
actaaaaatg 
gaggaaaagt 
tgaccctatg 
cgctgtggca 
gaatcttact 
tacccgacac 
cagtgagctt 
gtggagaaag 
ggtgttgggg 
agtgtatgtg 
atttttcctt 
agacagggcc 
agaaatgaag 
cccgaggaga 
aggggggagg 
agtatggtgt 
cagcctgtag 
aattctctgg 
ggatttcagt 



aaagcccctc 
cagacagaga 
gccacatcag 
agagacttcc 
gtccctgcct 
gactcagcct 
tatgtgttta 
cgaaagcaca 
atgtagccct 
ccatttgcct 
tgatttactt 
atgtgcatgc 
aagtcccaaa 
acaagagcag 
aatgtttgct 
tgagcctcat 
aataaactgg 
cctcaaccta 
ctcccccgca 
cgggfcacagc 
tcatataaac 
acaacaaaat 
aaatgcttta 
aaaaaaaaga 
cactggtaca 
ttatactttc 
atgagagagt 
tccctgcctt 
cactcagttg 
nnnnnnnnnn 
nnnnnnnnnn 
tgtgtttaga 
tagctcaggc 
ttaaaggtat 
ctatctatct 
acattcccca 
tccttggatc 
ttatgacccc 
gtttacagga 
taagtgttct 
ctctgccaag 
ttcttttctt 
tgatgtagct 
ctgggattat 
tcaaaaagtt 
gatttttaaa 
tagcttagta 
gtagaaggag 
cgtgcatgca 
gaagcaggtc 
gcttccattc 
ctgtcttacc 
agaacacagg 
cataatcaaa 
catggtcctt 
gccagtagcc 
atggcctggc 
ctatttctga 
tggagacaga 
agtgagaccc 
agaggggaca 
ggctgggccc 
ctccactaaa 
ctcctttgca 



agatttgctt 
ctgccaacaa 
gatgcttcaa 
gtctctgtcc 
ctgatgtggt 
cggggagctc 
aaagtacaaa 
aaccttaggg 
gactgtcctg 
ctgcctccca 
aactttattt 
ctggtgcccc 
caactgtaag 
tatgtgctct 
ttcgcaccca 
tggccaattg 
tgtgctatga 
ccgcccttcc 
gtaatccagt 
ttaaggcagc 
tttgtgttct 
gtttcaaatt 
actttttaaa 
aagatagaaa 
cagttacagt 
tttatagaat 
tagaattaga 
agtcctacct 
tttacacaaa 
nnnnnnnnnn 
nnnnnnnnag 
tttggtttgg 
tgtccttgaa 
acaccacctt 
atctatcatc 
gagcctgtgt 
aaacttgaat 
atggcccagc 
caagcgctta 
ttcccccaac 
cctctcccct 
tctttttgaa 
caggctggct 
agacctatgc 
cctagaacag 
aaatgcttga 
ggtagaagtc 
aagagcaatg 
cacacacaag 
tgccaggccc 
tcaaagttgt 
ctgggtcact 
tccgagtagc 
tctccccccc 
ccaagtcttg 
tcttggtgag 
tgcaggcttt 
gtgagagggg 
agccggagag 
actgtcatct 
gagagtgatg 
cagcagtgta 
tccctgttcc 
cccccctcca 
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ccccccctcc 
gtaaaagagg 
gctccgtgac 
agatgatggg 
acaccacgga 
agctgctgga 
tcccaataaa 
gatctgccac 
aaagcctcat 
cctgatgata 
ttatgctgat 
atggggctgg 
cccacacttg 
tgtgtctctg 
tcaatggctg 
ttcagtgcct 
gtctgtcctt 
ttttttttaa 
gagggagtca 
tccagacctt 
acatcattgt 
acccccaggc 
tttcttcttc 
catgtctcgc 
gcaggggctc 
tggaaagcac 
catgccattg 
caccctcctg 
gacgggggcg 
tctcgtagag 
agatctcctt 
ctcctctggg 
ctagaagccc 
gggaggagga 
tcacaaagag 
gatgtttatg 
gatttatata 
gaacacagac 
aggaagcaga 
caatggccca 
atccgatgag 
actaaagtgt 
aagtggttgt 
ccagcctgga 
gggagggcaa 
tagtgagtta 
tcccaacaac 
gtgtttaaag 
ggccagagca 
atctgtacaa 
aagaaagagg 
acagagatga 

ggggggctac 

aaaatacata 
agtcccagca 
agctgtatgc 
agacagacag 
ttgctattca 
ccaactccac 
tgttgtgcac 
agaaaagaaa 
ccaggtcctt 
ctttcccaac 
ccttcctggc 



acccccagct 
ctggtgagga 
actttaaatg 
tcagggacct 
gacccctgtg 
gcgcaagcct 
cccctttccg 
cttatcacac 
tctctgctta 
acaacccttg 
agaaatgtac 
ctggggcttt 
gttgcacaag 
tgtgtgtaaa 
cacaattgtc 
ttcctctgtg 
gggttgtctg 
agatttattt 
gatctcgtta 
cggaagagca 
cttgcccacg 
accgggacac 
caagcaggtc 
tgcctctctt 
taagctgcct 
tgtgacccaa 
atgtgcacgc 
gtgcttagga 
ggccacagca 
gtaaggggcc 
tgtccccata 
gcagcctcaa 
tcttacgtta 
aaggaaaagg 
ccctcttttc 
tattttttta 
ggaagtgcag 
ttgtaaaggg 
aggtaaaaaa 
ggcggaactc 
ccagagtgtg 
gcactggcct 
tattctagta 
ctgcatagca 
catttggaat 
gttcagcggt 
cacatggtgg 
tcagctacaa 
agtagaggtc 
ttacagtgca 
gtggggcagg 
gcttcatgtg 
agtgagagta 
ngcnaaaaaa 
cttagaggac 
agtcctgatc 
agagaaacac 
gaccccatga 
agagctgttc 
acatgcacag 
ggctggcttc 
cccagtccag 
gcagtgtcca 
tccctcacca 



ctggtcacag 
catcggaaag 
cctcagataa 
cagcgcctct 
accccctcgt 
caaggcctgt 
cccaggatta 
acatacaaaa 
acccaagtga 
caatccaata 
tctgcatgtg 
gcacatcatt 
gcagaatgac 
acgcctctct 
ctttctcttt 
ggtttgtcct 
gctagaacac 
atatgfcaagt 
ggatggttgt 
gtcgggtgct 
actgctctcc 
aacatcttct 
tcccaggaaa 
ttcgggaacc 
gggcaaagga 
gcacattttg 
tgccacacag 
actaacggct 
gcgcaggctg 
cactgaaatg 
cctcagcccc 
gcccagcacc 
ggggatgaca 
gaggggagag 
tggctttttg 
gaacccgtat 
tgagttaagg 
ttttgtaaca 
aaaaaaaaat 
ctgcttgaaa 
gccatagctg 
ttccttgaca 
cttaggaggc 
acacccagtt 
gtaaataaat 
taagagcgct 
ctcacaacca 
tgtacttaca 
ctgagtttaa 
ctcatataca 
ggagggaaga 
gaaacacagg 
tcatgagttc 
aaaaaaaaaa 
tgaggcaggg 
ctatcccctc 
aaagaaaggg 
cctgagctca 
tatgatctct 
attagaaaaa 
ttccacgtca 
tgggggctga 
gcaaactcaa 
atgcgggggc 



ccagtcagag 
tctgtaccct 
aacagtgaga 
gaggctcaga 
gcagagggag 
gctagttatg 
gtggacacgc 
atcccttgag 
cacctatatg 
gaggggaact 
gggagcctgc 
gggactcaga 
aggatgttat 
ctggagccct 
ccaaggacct 
gctagccccc 
agacatcatt 
acactgtagc 
gagccaccat 
cttactcact 
agaatggtgg 
acacgtgtag 
tggcacttac 
ccccagaggg 
gcagggggta 
cagcagtaat 
aaaccagtga- 
ctaatgagaa 
gcactgcgtg 
tcacttaaat 
acgcttcttt 
cactttttag 

gg a gg ta g a g 

agggaaagag 
actgcactgt 
ttattaacag 
ggggcaatta 
tccaatcaaa 
tgtcccatta 
gaaggtgaga 
ggtcatgagg 
aaggatgcac 
tgaggcagga 
tcaaaataac 
aaaacatttt 
gactgctctt 
tccataagga 
tataataata 
ttcccagcaa 
taaaataaat 
agaagaaagg 
cctgtagtcc 
aaggtcaact 
acatagccag 
cagaaagaaa 
cccccccccc 
gccttcagat 
aagcctggga 
atatgaatgc 
gaggaggaag 
gtgtgagagg 
actgaggcag 
ctctacagcc 
attggctccc 



ttgggggtgg 
ccactagcaa 
gactctcctg 
caccaggata 
aatgccaatg 
agtctactgc 
cttgctcaag 
gatggttacc 
gcagatccct 
cgggtttctg 
cttgctcacc 
gatgttgact 
gcctggtgtg 
cctgtctgtc 
ctgtatgggt 
tgtcactgag 
gtcttttttt 
tgtcttcaga 
gtggttgctg 
gagccatctc 
gcaggaggat 
gtcttgtgca 
agagattgaa 
agcagcagaa 
gcatggagcc 
gtcaaattct 
• cacaaaggca 
atgagagctg 
ttggaggagg 
tagccaccac 
ctttttttct 
agctgtaaac 
atcaggaagg 
atcgagagag 
gagttattta 
cctgaaagga 
agagagcaga 
ggtgcttcag 
gaagctgaca 
agggagggac 
cccagggttg 
ctatagctag 
ggatcaccat 
aacaaaagga 
ttttaaaaaa 
ccgaaggttc 
gatctacgcc 
aataaattct 
ccacatgatg 
aaataaatct 
taagaagcta 
tggcactcag 
tgggtgagac 
gcatgatggt 
aggaattcaa 
ccagagacag 
ggctcagcaa 
cccaaggtag 
tggggcatgt 
aaaaacataa 
agggtctggc 
cggaggaggc 
tgtcctgatc 
aggctcctgg 



ggcagatctt 
agtgccagac 
gtggcaggca 
aagaataaaa 
tggcccagct 
tgctgccttg 
ccgagtccct 
atcctgggac 
gtgtccttct 
tcagcttcct 
ctgagacccc 
acatgaacgt 
tgagtgtgtg 
tgcctcttgt 
gtgtccttca 
aaagtcttct 
tttttttttt 
cactccagaa 
ggatttgaac 
accagccccg 
gtgacccccc 
ctggctttgc 
gagtttaata 
accagggctg 
ttagccaatt 
gccgttcagg 
cagccttctc 
aaaggagaga 
ctgacccact 
tcccaacact 
tctttttctt 
caccctggtc 
agggagggag 
catgcattca 
gccaacaata 
gagagacgga 
aagagatacg 
gtattttcca 
ctggatggag 
acagaccagg 
gaaggacccc 
gcgtggtggc 
gagtgtatgc 
agtgggggtg 
agaaaggggc 
tgagttcaaa 
ctcttctggt 
ggagtgaggg 
gctcacaacc 
ttaaaaaaag 
aataaaaggc 
gtggggttgg 
cttgtctcaa 
atacatttat 
gatcaggctg 
acagacagac 
ttaaaggcgc 
aaggcaagag 
gcctacacta 
gattgtttca 
ccctttgtag 
aataacggag 
cacagagaag 
gccccccccc 
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acacctgtgg 
ttggctcttt 
gggttgcccg 
cgcaaatggg 
agcttggaag 
actccagttt 
gaaacccgca 
aggctggggt 
agaggcagag 
accccagcaa 
tactgtgtgc 
tttcttctga 
gactctccaa 
cagttctaca 
tcattttcct 
tgccaggggc 
aagactgcct 
agtgctgagc 
ctagcatgtg 
ggttcatgct 
ccatcctcag 
acaaacaaca 
ccatctcaaa 
ctatcaagtc 
gatggctcag 
gcacccacat 
acacagatat 
aaattggaag 
gaagggggag 
tttggggttc 
gtgggggcca 
tattactatt 
ggatgtctgt 
tctggtagtc 
aaccttattt 
gaccctcctg 
tntccggatt 
ccgctggncc 
nnnnnnnnnn 
nnnnnnnnnn 
agctctaact 
caccagcaag 
tggaaacgga 
aggactctgc 
tgctttctga 
gcaatgaaac 
ttatttcctt 
aaaagaaatt 
tctctctctc 
atgaatgcac 
gcaaatggtg 
aatccctttg 
tcctcctcct 
tacccttaag 
tttgatgaag 
agaagggttt 
gcttttggct 
ggatgcccag 
taaaagcagg 
tgaccacagc 
taatcgcttc 
ttattccagg 
gtgccggcag 
ctggggggaa 



agtgctaggt 
ggtaatagac 
tggctccctg 
agagcatgga 
cagcaggtgt 
tttggcacaa 
gttgaggaac 
gtgaagaaga 
tccctgagtg 
acaaaatccc 
cactaaccat 
gactgtgata 
ccccagctga 
ctggttgaga 
gtccccaaag 
caccccatgt 
ggtgggaggt 
ggaaaatggc 
caaggaaggg 
ggtattctca 
ctacatagct 
acaaagcagc 
aagcaagtag 
atgtatatat 
cagttgagag 
ggcagctcac 
ccacgcacat 
taaaactctc 
ggcataggtg 
attatgcaat 
aaggtctcct 
agtgtgtggt 
ggtctgagga 
aggtttgcaa 
tttgcattga 
ttcctgcctc 
gctgcncacc 
ctcccccccc 
nnnnnnnnnn 
nnnnnnnnnn 
gctgttaaat 
gcccaggtcc 
cggaaagggg 
tccttccccc 
ccgcttagcg 
actaatgcag 
tatgtagaaa 
tctcttatga 
tctctctctc 
acaaagtagc 
gtgactggga 
cattcaaacc 
tacctgtctt 
gacaattata 
tcgatattaa 
gggttcgatg 
agggtagaaa 
agaaggtgga 
gatgagctgg 
tcctgccctc 
agaaagccag 
caaggactga 
ggcgggacct 
aggctaccgt 



gatttgctaa 
atgcctagca 
cagctggggt 
acagaggctg 
ctgggaacgg 
ggaacaagct 
agcgggtaag 
gcccatgcag 
gggctccctg 
atccataatt 
cagcctaacc 
gtggcgggga 
gccccttgtt 
acactcacgg 
cctagacaat 
ctccactagg 
gagataagga 
agaatgggct 
cctgggttcc 
acattcagga 
agcttgagac 
agaaggccaa 
catgtatctt 
gcatgtatgt 
cacttggatg 
aactgtctgt 
aaacaccaat 
taaggagaca 
tgtggtgtgt 
gtatacatgt 
cctcaatccc 
gtgatgtgtg 
tgctctctta 
agctagtgtc 
acatggaact 
ccccntgtcn 
tccccgccnc 
cccccccccc 
nnnnnnnnnn 
nnnnnnnnnn 
tcacactcct 
agcttgactt 
catggaagca 
catgcctgcc 
gaagacaagg 
tgggcttttc 
gtgcatcttt 
actaatagca 
tgtctcctgt 
ccccccccat 
agtctggcca 
cgggctgctt 
cccatccttt 
ctcttccatc 
agagatgctt 
tcaaagtttt 
agagatggag 
aggatggggg 
gcgctaggag 
ccctccccca 
attggaatgt 
cccaaaccga 
agctgtgcag 
tttatctctt 



tgttgggcaa 
ggagggcgga 
gtctggccag 
tggatgctaa 
gcctgtggcc 
ggctgagccc 
ggtgccatgg 
ggaagccaca 
ggtctcccct 
tgaggtttat 
ctcccccact 
cgggttggga 
ctgcagctca 
ctctctcagc 
ggctggtcca 
ctggcaatgt 
aagggatggt 
gggaggtaac 
atccccagca 
ggtacagtca 
cagcctgggc 
ctggcaagag 
ctttcatttt 
ttgtatgata 
ctctttcaaa 
aattccagtc 
gcacataaaa- 
aaagggactg 
ttaacatgca 
gtgggtgcaa 
tctctgcctt 
tgggtgtgtt 
ccatgttcgt 
tttatctgcc 
tcctgagttg 
nnntcacang 



ccnagcctcc 
cccccccccc 



nnnnnnnnnn 
nnnncagaca 
tcacatcccc 
agagcaaagc 
gaaccagaca 
gtgcaactgg 
ggagcctgtg 
attaagccag 
atatgttgtt 
ctgatacata 
gcatgtgtgt 
attatttctt 
tgcaattctt 
gctgtggcca 
gcagacaatt 
agcaaacacg 
tatttacata 
aaaatcttaa 
atacttattc 
agggagggtc 
tggaaccagt 
gtctctctgt 
gttgctcacc 
tcatggcatc 
aaacagctcc 
ggcagatggc 



catttgccca 
gcttggaggg 
ctgaagaagg 
gcaatatggg 
caggcagatt 
aagaggcaag 
gagcccatgt 
catcccctcg 
tacctaacca 
agagacctca 
gtcttctcta 
gtgtgtgtga 
gtaacacagt 
tccttagaga 
tttgtaagct 
tctctgtcac 
ctcccctggg 
tctgttgcta 
ctacagaacc 
ggaagagcag 
tatgtgagac 
gagtattacg 
tttttacatt 
tgaaagtagg 
gaacctgggt 
tcaggggatc 
taaataattt 
aggggaagtg 
gaatacactt 
gatgtaagct 
attttcattt 
aagccctcac 
gtgggttctg 
gagccacctt 
cctggacagc 
aggacacacn 
tgcctccccg 
cttccccccc 
nnnnnnnnnn 
cacacactca 
acgctaggac 
atcctccccc 
acagcagcct 
ggaggcaaag 
attatgattt 
atttattcaa 
ggaggagcag 
gtggtagcta 
gtgtgtgtgt 
ctgtgggatc 
gccttttctc 
gccctttcac 
atcctcaata 
ggtgttcttt 
aagtcaaata 
ctagaggatg 
tgatgttgct 
cctcctcaag 
cagaagtgtc 
gaaccgccag 
ctccactgct 
tgccaatcag 
gttatggcgc 
ttctctcttt 



cgtggggttc 
gggagtcctg 
agccatggca 
aggcagtcta 
tccagtgagc 
tggtgataat 
gctcatgaag 
agttccaggc 
gtctcccggc 
aaggctgagc 
gctgcccctc 
agccctctcc 
aacacaggct 
gcctgttttc 
tatctgagga 
tgtagtacag 
gttcccacac 
gagtacttgc 
caggcgtggt 
aagttcaagg 
tttgtctcca 
taaagtaaat 
ctataaaggc 
gggctggata 
tcaattccta 
tggcaccctc 
ttaaaaaaag 
ggaggggcat 
ctataaaagc 
gtgcatatgt 
aaattataat 
ggcaatcaga 
tggatggaac 
gctggccttc 
aagtccccaa 
gcttantggg 
cccctcgccc 
ccccnnnnnn 
nnnnnnnnnn 
aattaaatat 
tctaaggagg 
ctccacacaa 
agccaagccc 
ccccagccgg 
ctgctgattt 
tctaaagatt 
agatgtgata 
tgcctaggcc 
gtgtgtgtgt 
tccagactca 
ttgccagccc 
ctggagtcct 
actagccaat 
ccttgagtct 
gctccctttt 
ggtgtagagg 
ttaaaaggta 
ctaatgaatc 
tgcctttgac 
cattaggagc 
cagaaaacct 
gaggccaaag 
gcagaaaaag 
gatgctttgg 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
cctctctctc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 



ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagtcatt 
aggagacgcc 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgcagggct 
gagggagggt 
ccccgccccc 
gcagcttaca 
agaggaatga 
ggccgagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgcggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggcagg 
gagaattcaa 



cacttgactt 
aaaaaaaaag 
agctgttttc 
atagcaggag 
ccacagagag 
gtgcacacat 
tgaggtgtgt 
tgtgtgtgga 
acatgccaac 
gggaccaagt 
ctcagccgcc 
tttggaagtg 
gagggaggaa 
aagacaatga 
ggcttagcca 
ggaagagttg 
gccaggcagc 
tctccttccc 
gagagaggcc 
ggtctttagg 
acagagctcg 
gtaaggttta 
gcctaccctt 
agacctggca 
aacttgcata 
acaactgggc 
agaaaggcct 
aggcgagcat 
tctgctccat 
aattagaccc 
agtggcagga 
tctcttaacc 
cctctggtgg 
gactcgggat 
tgatttattt 
cttcctgcgt 
gctgccfctgc 
atgacctcct 
ctctctatgg 
gacgctgcta 
aaatgttgcg 
gtggacgggt 
ggggacacac 
gccctcactc 
gggcttgttg 
aagagaaagc 
atccatgacg 
gccacaagag 
gtccttagag 
agaggttccc 
tcggaacagc 
aagggcgggg 
gggttgatct 
gctggctcag 
aaactttgtt 
caactcactg 
gaagtggatg 
atggctgaga 
ggtttttgga 
gtgcagatgg 
aggcactaaa 
ttgagggtaa 
agtcccaatg 
gatggcagag 



gacctaggca 
ccccaaacca 
ttcccaagct 
tcccaattag 
cagaggcaat 
gtgtgtcaac 
gcaccccagg 
ggggtgttgc 
aaacaggctg 
gctttgcaag 
acccagggcc 
ctgatgtcaa 
aagggggggg 
ccacaaggtc 
atatagctga 
gacatgggtc 
catgcccacc 
tgggtgccta 
actgggactt 
gacactgaag 
catgacaact 
ggagagcccc 
ctgaactcta 
ggggagcaag 
caagaaactg 
aacagatcct 
cagtctcagg- 
cttctgctcc 
gaactcatac 
cgaaaaatag 
gcgcaggatg 
tctcgagtcc 
gctggtggaa 
gccatgtaag 
acctcccgtg 
ggcacggcag 
caggatgaca 
gccccgtgag 
gggaggctga 
gtaattaagg 
tcatttaaag 
ggcaattagg 
cttccggtaa 
cccacccccc 
cccttaccca 
cagaaaagtt 
gtccgcgcgc 
cttcttgagt 
cacaagccag 
taaagaatga 
cggaccgtta 
tcacccagca 
gctcctgaaa 
ccagcttatc 
tcaagagcag 
cccgtgacaa 
acaacagctg 
gccttgggct 
aatgtccttc 
ccctgtgttc 
cacacaatgt 
aggagtgtaa 
ccagcaaata 
ggacaggctg 



aaaatagcag 
caacacaacc 
ctacagcgtg 
cgtgttgggt 
attcagcata 
gcacatgtgg 
ccttttgatt 
tcacagtgca 
cacactcttg 
acagcaggca 
agctctgatc 
tgcagaattc 
ggacgggacg 
cctgtaacta 
gacccagtct 
tccattctct 
gatgccatcc 
gaggtgctgt 
ggtagagagg 
gcagtttcct 
tgccggctct 
agatgtaagt 
acatggtcca 
cctttgtttc 
cctctgggcc 
tccaggtggc 
atttcacaca 
ccccaaacaa 
agttttcaga 
gttagctgac 
ggctgccagg 
ctgggtttct 
agcgtaatga 
tacacagcat 
aaagagacaa 
caccccctcc 
gcagcccact 
gtctgtcacc 
ggaaatgaat 
agcgaagcat 
agtgtcctgg 
ctggcacggg 
ttgctgctgg 
acttctttcc 
gaaaatcacg 
catattaggt 
atataaccct 
tgccacctgc 
gcctgattct 
tgcttctatt 
ccttagctct 
ttgccactcc 
aacagtgttg 
tccccggtgc 
agtgtggtgc 
ccagtacttg 
tatccaggtc 
gcatccctcc 
ctgtgggttg 
ctggtgctgc 
ggttgtatct 
ggactgtgta 
caatggactc 
tctgagtttc 



cgagatacag 
cggaagtgtg 
gtggctctaa 
aatctttcaa 
aagccctaga 
cttgtggagg 
ctcaaagctt 
gaaacccaaa 
ctagtacatc 
cagagttctg 
tggcttgacc 
agcagagtgg 
ggccgaggcc 
cactaactgc 
tggtgctgtg 
ctcctcatcc 
ttctgggagg 
gtctgcatcc 
ttctccacac 
ctgttcagtg 
caactcccat 
atggatggga 
gcttttccat 
tgacccattt 
tctcctggac 
tgcagtgaca 
ctgtatttcc 
gtgtgggaat 
taccgaggag 
aattacttgt 
gctgcagtct 
tgccgggatg 
atcaacgctt 
cgccagccac 
caatcatcat 
ctctccgctg 
ctcagcctct 
cgaaaaccac 
tccgtaggta 
aggtgcgtga 
tgcttcagtt 
tagagctcac 
gcagctctgg 
tgacagctct 
ttggaagaaa 
tcggatctgc 
gtggcaacag 
caggagacag 
ccaatattga 
tccatgctaa 
gctgttctcc 
ccccaccctc 
gctcaagttt 
tttcaagatc 
caactctgat 
gctgttgtgg 
caaacagagt 
tcccctcctg 
tgtgcctttt 
tgttgagatg 
gactagaagc 
aagagatgag 
tcctggtcga 
ctatggctgc 
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accgataaat 
gactagaaat 
tcacactgtg 
gggacacatt 
gagaaatggc 
ctagcagcca 
ctaatacata 
cacacacaca 
ggctcagcaa 
catggtggct 
acagatacaa 
gaaagaaaga 
aaagaaagaa 
acttccaggg 
gaagctgtag 
gtagacaacc 
caacgcctca 
taggacactc 
agacctatct 
catccaactc 
atctgctctg 
gaggactttg 
ctctaccctg 
ctgtgcccac 
agactgagaa 
aaatgtctta 
tttgtgtgtt 
taactaagga 
caggtgtgtg 
tgcatattag 
tttgagacaa 
accagaagaa 
atttgaactc 
agtccttgag 
gcttcagcct 
cttccttccc 
cttcccttcc 
cttcctttcc 
taatcctggc 
nnnnnnnnnn 
nnnnnnnnnn 
cgacaacacc 
ccaattttta 
atagctcctt 
gtcctagaac 
atgatagact 
aaagtcatcc 
ctcaaaaaga 
gggaggcaga 
ccaggacagc 
agtgaacccc 
tgtttaccca 
tgtttgaatg 
ggcagtattg 
gagtttaaag 
tggcctgctc 
ctgtttcctc 
tatactacac 
tctagagatg 
cctccatctg 
atgagacctt 
tggtggtggc 
agaaaggctt 
aataatgatg 



ggtcataagc 
ctagctcaag 
acatgatgct 
catggaccgc 
ctggtagtta 
tgttgggcag 
catacataca 
cacacacaca 
ttaagagcac 
cacaaccatc 
tgtactcata 
aagaaagtgt 
agaaagaaag 
ctataggcaa 
acaacctgtg 
tgtggcctgg 
ccctctgtgc 
tacatcaaga 
ttacatgtga 
actatgtgac 
agagtcccct 
gatctgcaaa 
aaaccctttt 
aagaagtccc 
cattgtgtct 
ctgcctttat 
tgtttgtttt 
tgactttaaa 
ccaccgtacc 
gtaagcatgc 
tgtctttttt 
ggcattggat 
aggacctctg 
acaatgtctt 
cctaaatact 
tttcccttcc 
cttcccttcc 
ctctctctct 
tgtcttggaa 
nnnnnnnnnn 
nnnnnnnnnn 
ggcgccgctg 
aaaccagtgt 
actttgtttt 
tcaatgtgta 
gatggtgtaa 
ttggctatgt 
aataaatgta 
ggcaggcaga 
cagggctaca 
aacagtactg 
ggtgtaccca 
agagaaaagt 
gctggagagg 
cttcctccac 
ttggctccgg 
ccccacccct 
tggtgatctt 
tcctttcatg 
aactttagga 
ggagcctgca 
agtagtagaa 
tgggtgcaca 
acgatgaaaa 



agagtagagg 
gttgtgtgtg 
gtggtctgca 
aggttacaca 
agtatgcttg 
cttacaacta 
tacatacata 
cacacacact 
tgactgctct 
tgtaatggga 
tacattaaat 
aaacgaggaa 
aaggaaagga 
cctgtggcct 
gcctggggaa 
ggaagctgta 
ccaatttcct 
tgatcttgtc 
ggtcactcca 
agagtcatct 
gcctgcccgt 
gccacatctt 
ctctgaggtg 
ggggagaagg 
cacctaaaat 
tccttttcct 
cttctgagat 
cttctgatcc 
ccagctttat 
taccaacttg 
tttttaatta 
cctattagag 
gaagagcagt 
gctatatggc 
gggattacat 
cttttccctt 
cttcccctcc 
ctctccccct 
cttgctctgt 
nnnnnnnnnn 
nnnnnnaaga 
cctccactgc 
ttcaagccgg 
gctttgacgt 
aattaggctg 
aactccactt 
tgtgagtttg 
ttgccgggca 
ttttctgagt 
cagagaaacc 
ccggacagtc 
caaggtgtgt 
ttcccatcag 
tgatggggag 
gaagaccact 
cctcagttta 
ctccctgtct 
gactgtattt 
gctgggacct 
aacttcttgg 
ctttgttaag 
ctctgatgaa 
tggctataac 
taattctctg 



aaaaccacag 
gcagggttgg 
gatgtgttgg 
tgctatttaa 
ctgatcttcc 
actatgactt 
catacataca 
ttaaagaaaa 
tacagaggtg 
tctgatgccc 
aaataaataa 
aattcctaat 
attctgaggg 
ggggaagctg 
gctgtagaca 
gacaacctgt 
gttctctaag 
tcaagatgtt 
caggttctag 
agagatttgt 
gggaactccc 
ccaaaaccat 
gcttttagag 
gacccaaggg 
cggtggtcgc 
ccgctccatc 
gtagcccagg 
tttcttccct 
ttgagactat 
gctatattcc 
tatgagtaca 
atggttgtga 
cagtgctttt 
acatattggc 
gtgagccatg 
ccctttccct 
cctcccttcc 
ctttcttttc 
agaccaggct 
nnnnnnnnnn 
agaagaagaa 
catccacctg 
tacactgaag 
tttgtgacat 
gccttgaact 
aggaggcaga 
aggaccaacc 
tggtggcaca 
tcgaagccag 
ctgtctccaa 
tggtgtcttt 
gagcagctct 
ctcgggtgtg 
gtgtagcctt 
gggctgatct 
tctaaaattt 
gtattccctg 
cagtttacct 
ggctcagaaa 
cttaaaggtg 
caccctgggt 
cagttagtta 
tcagtggcga 
tgactgttct 



acagaaattc 
ttcctgggtg 
gctgcatcca 
aaactccaag 
agaagacctg 
ctgagctcca 
tacatacata 
aaattctggg 
ctgagttcaa 
tcttctggfcg 
gaaagaaaga 
taaaaaagaa 
agaatctgcc 
tagacaacct 
acctgtggcc 
ggcctgtggc 
gacacatgcc 
taacaaaatt 
acatattttt 
gtccaggaca 
cagtggtcct 
tttcctcttt 
aggcaggtct . 
ccagtgctga 
aaggaccaag 
ttactcctca 
ctggccttca 
ccacttccag 
gattcaggct 
cagcctttct 
ctgtatctgt 
gccaccatgt 
aaccgctgag 
ctcaaactca 
gtgtttggct 
tttccctttc 
cttcccttcc 
tttcagagag 
ggcttgnnnn 
nnnnnnnnnn 
gaagaagaag 
agacaggact 
tagtagtccc 
ggtgtgatgt 
tgcttctgcc 
ggtgggagga 
ttggctacat 
cgcctttaat 
cctggtctac 
aaacaaaaaa 
cctaagtctc 
atacccagag 
tgaatactcg 
ccggtggaga 
tgttaccaac 
acatgttacc 
ccctctagtg 
cttgttctct 
tttctaaggc 
tatttctgac 
ggtggtggtg 
ttcaaggccc 
gagacgtgct 
ccccacttcc 



atgccattga 
ttcaaccttt 
tagctaccct 
ggaagggcta 
agctctgttc 
aagctctctt 
cgtacacaca 
ttggagaggt 
ctctcaacca 
tgcgctgaag 
aagaaagaaa 
agaaagaaag 
ccttttccta 
gtggcctggg 
tggggaagct 
agcatcatgt 
atcaaatgca 
acatctgcaa 
gaggagccac 
gactggctgt 
taagggccct 
tggagagcta 
. cagcagggca 
actatcgctg 
caggctctat 
tttttgtttg 
gctcactatg 
agtcctgggg 
ccatacttca 
ttctttcttc 
tttcagacac 
ggttgttggg 
ccatctcgcc 
gaatccttcc 
tctagccttt 
ctttccttcc 
cttcccttcc 
tttctctgtg 
nnnnnnnnnn 
nnnnnnnnnn 
acaacaacga 
caaatccaga 
acttgggatt 
agtcttggct 
tcctgctggg 
tcagaaattc 
gatatcctat 
cccagcactt 
agagtgagtt 
caaacaaaaa 
ctttcaactc 
gtgatacggt 
gtccccagtt 
tgggctttgg 
agaattggat 
tgatcaaaaa 
gtgctggctg 
ctctgctgac 
actgagcctt 
ttagtatgca 
gtggtggtgg 
atctagaaaa 
tcccatgaat 
ctctctctca 
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ccctagctct 
cggaacgtta 
tggctagtca 
tcagcaagaa 
gctaagtggt 
ctacactgtt 
aactggaatt 
gggatatttg 
gtagaggcca 
ctctcgatca 
ttggccttga 
agaggcctcg 
gcttgttgaa 
tctgccaagt 
cttggagctg 
taagtggcat 
atcagctgac 
ggattctgac 
tccctgcatc 
ctccccagca 
atctcctcag 
ttgatgaaga 
ccacccatgg 
gtaagaagga 
ctggggcgga 
tttgaagcaa 
ctatctcagg 
gctcttagaa 
ggagggagtg 
gccaagtcat 
cccctaggga 
ggcctgagtc 
ggtttctggt 
tctctaaaag 
ttcagcgact 
ctagacctcc 
ctcgagctag 
gaaaaaaaaa 
gtggctggct 
ttagctcagg 
tgtagccctt 
acagagcgcg 
gcactgcaaa 
gctggtctaa 
ttggttagtt 
ccaatcagag 
tgggtgtggt 
ctcgtgcgtc 
ggaagagaca 
agcaaccccc 
aaccgcggta 
ggggagggga 
ccgagtagtg 
ttggagaagt 
cgccctttgg 
gaggggtgcg 
cctggcaggg 
ttatgtaagg 
aaaggttagg 
ctggaaactc 
gtcacctttg 
ctgactgcct 
cctctctccc 
cgttcccacc 



tatctaccga 
gtcacttccc 
ctgaggaaaa 
agaagctgac 
ttcaagaagt 
ttgtagatga 
tgtttggcta 
gggacacttc 
gggtggtact 
aaacatcaac 
ggtgggtggg 
agtgaggagg 
acttttcttt 
cagtcccggg 
gggcaacaca 
ctgagaagtt 
ctaggaagga 
ccactaggct 
atccgcagga 
gtggcctccc 
aagtctgaat 
aaagatcatg 
cggggttgag 
ggatgtggtt 
aggcaggaag 
gcccagcagt 
cttccctctc 
atgactcaaa 
aagaggtggt 
tgggttcatt 
aggagccgcc 
agagaagcgt 
tgatactact 
tgagggcgaa 
aagacgagac 
agtgaagatt 
ctctccagtg 
aagtcctatt 
tccatcttta 
cctcctgtga 
taaggacttc 
ctgcattctg 
aatacacgcc 
gcaagtcgcc 
ggtaggcgtg 
cccagacgcc 
cacgttgccc 
acgacgccgc 
gagcggtcgg 
gcacccagca 
accgggtgat 
gagggggcgg 
tgtctggacg 
aactggggag 
tttgaggatg 
gaggtgcatt 
aaggggtcgc 
ggggagggga 
gtttcaccgc 
tacccgcacc 
cacctccccc 
gggaagccgg 
cacctccrtgg 
tccctctggc 



atccctgcac 
ttcgcctact 
cggcagctgc 
agcaaatgtg 
ttcttcttaa 
agaaaacacc 
attcagtggt 
tggtcgtcat 
gaccttccta 
agtgctaagg 
aggaggttag 
gaacagggct 
cttccaagct 
acttcgcact 
tttacctggg 
aaaagtgggg 
agagagcaac 
tatcttccac 
gcccctggag 
cactaactga 
aagtaaggga 
agaagcatac 
ggggaagggc 
agggctgatt 
tctccttaaa 
caccttagtg 
atttctttgt 
agtcaagaac 
cccagtcttg 
tcggtttttg 
tcttcattgg 
ttcggagggc 
ttggtaaaga 
tcctttgtta 
attcactaga 
gggcctgtgc 
aagactgggg 
tgtggaaaaa 
gttctactca 
atcagttttg 
cccgcgaccg 
ggaagctgag 
tatacttcct 
taggccgagg 
gcttagagaa 
tgaatgggcg 
agcaatgggc 
cagctgatcg 
ccgtgcggac 
tcaggtgggt 
ggggaaagta 
cggggacagg 
tttaaagaga 
gaatatgaga 
cccgagctga 
gcctctaccg 
tgggcggtcc 
aaggagtagg 
gggttgggcg 
ccccctccca 
ctagtatgtc 
ctggctgggt 
ccaactcttg 
cctagtttca 



aagcacccat 
tcagtattgt 
ctgtgggcca 
ccaactgtgg 
ccccctacaa 
attccgaagc 
tctcaaccag 
aattgggatc 
ctatgaacag 
ttgagaaatc 
agagtggaat 
cctatgctag 
tggctagagc 
aagtttgtcc 
tcttcccagg 
trgagtgggta 
tgaagcagca 
agcctttctt 
cacccacatt 
cagtggtgac 
cagatgttgg 
attttacccg 
agaaaaaagg 
cgtcccctgg 
gacacaatat 
gagcccatac 
ctcagatgct 
cccaggccac 
tccctttaaa 
cccatccccc 
tctccacctt 
gggactgaat 
tcttccccta 
agaacgtgcc 
aaagatttca 
gggtgacatt 
ccgtgcgagt 
aaaaaagact 
ctcctgttgc 
aggctaaaag 
agtcagagat 
tgtcaccgta 
tctgctcttt 
gttagccaca 
gctcctccag 
ggagtaagca 
ggtgattggc 
gagactggag 
aggtcgcagt 
gtgatctggg 
gggtcctgac 
ggcgctcttg 
gagtcccgga 
ggccagaggg 
ccatttagcc 
gcgctgactg 
ggcccctcct 
gggcggcggt 
gaggttgggg 
gcctaactgg 
cggtagagag 
ggggcgcctg 
gcccctcccc 
gtctccaact 



ggggtttaca 
gtttccagaa 
ccagcccatg 
gtctgctgga 
gaaaccacaa 



tcactgccag 
ggtggatttg 
atgtgctact 
ggcagccatg 
tcaggctgaa 
aaaatcagaa 
ggataatgga 
cctgctcaat 
tggagtgacc 
agtgggttaa 
aaggcaggag 
aagagctggt 
gtttaggctt 
cagccggccg 
aggaaatatc 
ggagaggcgt 
ctatggttgg 
agatggagaa 
ggtcgaacaa 
tctgaacgtt 
ttaatttaac 
ctcacttaga 
aagtttctct 
taagcaattc 
gcctttcaga 
tgaaatcact 
gggtgttaat 
atttttaaaa 
ccttgagaag 
ctaaacccac 
tgtccctata 
gacagtggtc 
tcgggtgttc 
ttcgcgtgct 
aagttccaag 
cagtttaaaa 
agaacttcat 
aaactgtagt 
ccttttcagc 
gcaagggggt 
gaggtgctgg 
cctgggtggt 
ccggtgtgtg 
gattttgctc 
gacccggtca 
ggccacaccc 
ggagaggagc 
caggagtcgt 
ccgggggcgt 
tagggaggat 
ggtcagggcc 
ctcgttccct 
gcggaggcct 
ggggcggaca 
ctgtcttgga 
gcccctagcc 
ggttagtcat 
acggcctccg 
catttggcct 



gaatctgggg 
gtacccattt 
ccaagtgagg 
tttctactgt 
attttattat 
tcagcactgg 
gactccccag 
ggcacctagg 
tacataaatt 
accctgtcat 
gggccaccac 
gaatagggca 
ttcccccaac 
ctgactccag 
aagtcaaaga 
gaccccatca 
ccaggagact 
gggctcagtt 
ccaggacagg 
tcccattcca 
cactcttggg 
ggttccatta 
ggacagacac 
tcagctgtta 
gaactcagga 
acagaagcgg 
aagcttagat 
ttgggggtgg 
agcagctttt 
ctctgattgg 
tccctaagta 
cttagaaccg 
agacgcttcc 
ccgtgggctc 
gagggataga 
ccccgaagac 
cctatacccc 
tgctgcatcg 
ccaccttcgc 
aaggaggggc 
atgccaactc 
tgaccggaat 
ttgacgtaaa 
cattggccag 
ggcctccttg 
cgcccccgag 
tcattcgcag 
ctgggcgctg 
ctctgtccac 
tcccgggggg 
tgcccttctg 
ctggactctc 
ggcagaaggt 
ctaaccccga 
ctggacgagc 
agttcaagtc 
cccggggatg 
tatgcaaccc 
ggaggagtgc 
cagagagaag 
cgggcttggc 
cgctgggctc 
gttaggctaa 
gtcaccctgg 
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ctgttagagt 
gggtcagtgc 
tcagcagcca 
gagactgatc 
ggaatttgaa 
aggcaaagtc 
gaaccaccct 
ccatgctgta 
cttggaactt 
cagatgtacc 
cagccaccct 
tcctgagagc 
aaccctcctt 
atttctgaag 
atgccgaaga 

aggggcacaa 

gaaggtactg 
acccagcttt 
ccctcacctg 
aaagctgcca 
gcagcagctt 

tggggtgatg 
gcccagtgaa 
ctccccaaga 
tgtttactga 
cacacacaca 
cacacacttt 
actccgttct 
tcgagacagg 
tggcctcgaa 
tgccaccacc 
tgggtttagg 
ccatagataa 
tcttcttctt 
caccccctga 
gtcccaggca 
ccaaggcttt 
tgccatttat 
gcaagaaatt 
cctgattcat 
tcctaacaga 
ggagttctag 
ctaatccagc 
cgggctgggg 
cgcttgcttt 
atggctctgc 
caacagaggc 
acattggtca 
gcctcagtga 
tgaagggcct 
aggagcatga 
gtaacactgt 
aacatgcacc 
ggtctgcaat 
ctgctataca 
caaaataaaa 
tccctgatca 
tggcagtacc 
gtatcctaaa 
tggaacgcag 
tagggcttgc 
gtcttacaga 
gtatgagttt 
accatccaat 



aggctagaag 
cctgggccca 
gtgaaccctg 
acctaggaga 
agttgcccac 
tccttggccc 
ctcgggctcc 
gggccccggc 
ggaacaaagt 
cggaatgtgc 
acccatacct 
ccctgctgac 
gctgctgaat 
tacatttttt 
tgctgggctc 
agcaagtgca 
aaagtcagct 
cctgtttttt 
gctgagccgc 
gttgaagaac 
tgcctatcat 
gctttatggg 
agatttaggt 
ttagttgatg 
gacttccttg 
cacacacaca 
cctttccatg 
aagtagtcat 
gtttctctgt 
ctcagaaatc 
gcccggctgt 
ggtacttacc 
gtaggcagga 
ctcgtggatg 
ctactcaccc 
agctcgtggg 
ccctccacct 
tttctgcctt 
gtcaggggag 
tctcacgtat 
agatgggttc 
cctgttcccc 
cccactgcag 
aggctgagtc 
ctttcctcct 
tgtcctcatc 
aggcacaaga 
gagaggcacg 
aagacacgtg 
ccaggaagta 
gccacctctg 
gggaaaacat 
aagccctggt 
cccggcactc 
atgagttctg 
taaagcattg 
taatacaaca 
aaccattgtg 
ataaggattt 
ctgtttgttt 
ctaagtaaat 
agaaaaattg 
gtgactgttt 
tgagtggaaa 



ctgtcatggt 
ccccgccccg 
tggcctgaac 
aggaagatcc 
cctagtgtaa 
ttctatctgg 
cagccctcta 
ttttactgct 
tcagacgtgg 
aagcggaatg 
ccctcccctc 
cttccaactc 
tccctaagaa 
tttttaactt 
cttagcaggt 
tttagaagcc 
agagccaggt 
tttttcctcc 
agtagttctt 
tgttgccctc 
ccggaaggtg 
agggaaaccc 
taaaggcact 
tctgtgtggc 
taggcctgcc 
cacacacaca 
aggtccaaaa 
ttgtgtattt 
atagccctgg 
tgcctgcctc 
atttacattt 
cttacacctg 
gggcattaaa 
gaaacgaaac 
agctcagggg 
aggctagtcc 
aaagcaaaac 
tgctttccag 
atgggttgta 
gggcttggtc 
agggggtaca 
atttgtgaga 
tctaagctga 
ggctcacaca 
cccttccctg 
caacatggag 
cagtggagga 
tagaagcctg 
cttccagcca 
tggtcccatc 
aaaggaaata 
atctatgact 
tctgtcttct 
tggaggtggt 
agccagcctg 
gttagtaatt 
agcaaatgaa 
agagatgcct 
ggttataaaa 
ggaagtccat 
tatattcatc 
agtggaaata 
ttaaaatatg 
tataaatact 



gccagagagt 
cagccaaggg 
agagctatcc 
gacaaagttt 
tctttccact 
cagtggccat 
gcctgccacg 
gattcatgcg 

cctggcatct 
ctgcctttgg 
tagtgcccct 
caagtcattt 
gggacttggg 
tgccaagagt 
tcttgcttct 
ttggatggcc 
ccttcctttt 
cagtggcaag 
tgcccctggc 
acagaactgg 
tggtcctctg 
gtctataaat 
agtgggaaga 
tttcttatct 
catacacaca 
gtaaatgtac 
actttttttg 
ctgtcctgga 
tgcctcccaa 
ctttatttat 
tggatttttc 
agtccaccag 
agctcttcac 
gattaggatg 
tctactggct 
tgtagctctt 
gatagtgaga 
tgatatgagt 
tctgattgtg 
ggaggctgtt 
aactgaaagt 
gggataggat 
ttgcgacaaa 
gccacagtgt 
cctcagaggt 
cctggcctgg 
gagaacacca 
tctcctctca 
tctaccctgc 
cacagcaaat 
ggggttgtag 
gcattgcata 
ggcaaaggag 
ggctatatga 
caaagaaagc 
agccagaaga 
ttggacctgg 
tgttcatctc 
tccttttctg 
tgatggtggt 
taaatattga 
aacacatact 
gaatactaaa 



tgatggagca 
cacctgcttg 
tgggcagaga 
atacttccca 
ctctgaaaat 
gtccttggac 
cccccagccc 
ttggaactgt 
agacagcctg 
ctagtcctga 
tcagctgtcc 
cccatttcta 
gagttgatca 
ttctacaccc 
tgccagctcc 
tattcaagaa 
tctgggtcgc 
aggaacctgt 
ctttatgtcc 
ttcgtggagg 
ggtgggaagg 
gggagcccfct 
tggggaatag 
aatagaagga 
tcatcatcac 
catacacaca 
tcaggaaggg 
tttatttgtt 
actcactttg 
gtgctgggat 
ttttagtctg 
cacctgtata 
tggtgactca 
atgaactgtt 
gaaggaaagg 
tctcaccatg 
ggttgggttc 
ctctgctcaa 
cccttctgct 
gttcaccttt 
tgttgtattt 
cataggggag 
gtgtaaggga 
gattgccctfc 
gtccctccag 
gagaaagggc 
aaccacaagg 
ggaaagagag 
ggacctgcct 
agtttctata 
tcaaaaagag 
ctcagttggt 
aaactgaaca 
cctacattca 
gactgtctca 
agatgtggct 
aggctcctgt 
tagtttgctg 
agggttgtaa 
ctgtcatgaa 
gttctgtgca 
atactaaaaa 
tgtaatatat 
aagattatga 



gctggtcaga 
gcacaaactc 
gaagtggaca 
agaggctttt 
agaaatccca 
tgactgtgca 
cctccctgag 
gggggcgggg 
gaattcatac 
ggaagctgcc 
tccctcagac 
accctacaca 
cagagctcat 
tgccctttga 
tagtctgtaa 
cccctcatta 
tggccctgtc 
gcctcccaca 
tgacccagct 
aagaggagaa 
tctggacagc 
acccccactg 
gtgactccac 
aaagtctgtc 
catgccaaca 
cacacacaca 
ggacattgaa 
tgattgactt 
tagactaggc 
taaaggcgtg 
gcccagattt 
atggggaatc 
gagcctgggc 
gtccttcccc 
ctatggttaa 
catgggtggt 
tagcaaccac 
tactgtgcag 
gcctctagct 
ggcccagtct 
gacaggagga 
actagatcat 
ctgtagcaga 
ccctcgacct 
cactgggtac 
agcctggaag 
gcctatccgg 
cagccagcca 
tcctgggaga 
aacagcctca 
attcaaatgt 
aggtttgctt 
ggttggccca 
aggtaatcct 
aaaaataaaa 
gaaaccgttt 
gccttgtgtg 
tcttagaaat 
tagagaaaaa 
aatgtatagc 
gccatccaaa 
gattataaaa 
ttttttaaaa 
aaagtatgag 



109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
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tttgtgactg 
aaagtggatt 
attttcccct 
ttaaataaaa 
tgcctgtgag 
gccgtctctt 
ctttgtgcac 
ctggagctgc 
ctagaagagc 
catgtttaga 
aagaaaatgt 
tgtgtgtgta 
ctccaccaca 
tacacacagg 
agaataagaa 
nnnnnnnnnn 
nnnnnnnnnn 
aacatagctt 
ttaattrtta 
gttaatgcaa 
taaaagtcat 
atgcacacat 
taaataattt 
tgctgctttt 
ttagaaaagt 
cgtcagcaca 
caactcactt 
cacctgaagg 
aagccttagc 
tatttctcga 
caaaaaaatc 
ggttaggggc 
actctgacta 
ggttgtatag 
aaagatttat 
gatcccatta 
ttggaagagc 
taaaagttta 
tgtgtttgaa 
ttgattcttt 
cagctgagct 
agaaggaatt 
gtatgatatg 
cagatcacta 
taatcaaaat 
cctgtaacct 
aggacaacag 
gaacaaaacc 
actctgggac 
agcttccacc 
ctagcggaga 
gggtgcatga 
gaagccatgg 
gtaatgaatg 
gaaacttgga 
ttaaaagaaa 
aagcatcaac 
tcgggctggt 
ttcaaatccc 
ttctggagtg 
aagtattaaa 
ttttgttcct 
tgctaaatag 
tggcctaggc 



tttttaaaat atgaatgcat acttgtaata 
caaaatgtta agaatggttg tttttgtatg 
tgttttttct gtctttctaa tttttaaata 
tacaaaagac aaataaatgt tttaaaattt 
tggcaggaat ccaacattgt cttctgaaag 
cactccctct gtaattttta aaaaatatat 
cagagtgtgg gtgcacattc tgcagaggcc 
acgctgtttg gatcttctga tgtggatgct 
agcaaatgct cctagccact aaagccatct 
ttttggagaa tttgcttgta ggaggatggg 
ttgcttggga tacctattgc ttcttgagtg 
tgtgtgtgtg tacactggag ctaggaatca 
ctaaagtcaa ttctatgaac ttcattaatt 
aagcattcct ctgactttct gactgtcagc 
agaagggaaa tgctcaaaac ctgtcactct 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnncccc tattttacca 
aaagaattct agagaaaaaa aaaaaagtcc 
atgggaaaaa gtaatatcaa agaaatccat 
acaggacaga ttatcttggc caaattaaac 
ctatttatac gagatgaaat gtgaattagc 
acaaacagag tctttttatt cttaagccta 
taagctctaa aatttaaaaa atagtgaaac 
gccattcttt ctgataaaag tcccaaaggg 
aaggagcaca tcctatgagc tggtgacctg 
cagcaatgtc ctggaccatt tgtgaggagc 
ggactggtac ggggtgaggc tggcgatggg- 
ctgcagcagg ccagttccgg caggttttcc 
tgcagtggaa cctatgcaaa cacacaggaa 
agtgaatgta caggcaccat ggcacgcatc 
caacttttgt gagacaggct cccttcattg 
acaagctccc aaggagtatc tgctctccta 
ctacacctgg cttttaggtg ggttctgagg 
caaattttta ttcaccaagc cagacaaccg 
ttttatgagt acagtgtggc tgtcttcaga 
cagatggttg tgagccacca tgtggttgct 
agtcagtgct cttaaccgct gagccatctc 
aaattaattc tcattatgca cgtgccacag 
gtatatatgg ggatgtagtt atgtgacggc 
cctttcactg tgggtcctcg gagcgtcaat 
atcttgtggg ccaccaaact tatttttaaa 
catgtaattg aaatagaaaa gcagagttga 
gtaagaaaaa ccaagaaaaa acctctacaa 
tgttcaagga tggtgtccca gaaaagggtc 
gatcccatcc agacctgcta cagagaagct 
caagtccccg ggcagcacag cagagctgtg 
gatatctgag aaggagtccc agtgagcgtg 
aacaacttca ccatgatgaa catcatgaag 
acagtgtcac acactacagc ttccacctgg 
tgggacacaa tgtcacacac tacagcttcc 
ggctgcaagg gtggagggca ggtacaaggg 
tgtgaaactt acaaagaacc aataagttag 
ccaacaaccc cagcactctg gaggcaggca 
ctatgtctaa agagatgata aagtcccatg 
cagaaaagcc aaatgtaaca aatgcatggt 
tttcacaaaa aggcacagaa tgtaaggtga 
taagaggagt caggaacagt gtattttaac 
gagatggctc agcggttaag agcaccgact 
aacaaccaca tggtggctca caaccatcca 
tctgaaggca gctacaagtg tacttacata 
aaatatgcaa taattggaat tattctttgc 
gttgtttgtt ttccgagcct gtttcatgta 
tggaagatga ccttgaactt ctgattcccc 
tactctgcct gactcaggct cagtgaattt 



tattttttaa aaaaacactg 113700 
gtgggatagt acaattgtga 113760 
ttgtgcattg ctttcatatg 113820 
ttactctttt atgagtgttt 113880 
cagctagcgc taacttctga 11394 0 
ttgtatgtta tattatgtgt 114000 
agaagagggc atcagattcc 114060 
cagaatcgca ctcaggtcct 114120 
ctccctctag tcctcattgt 114180 
ctacaccaag tgccaggtga 11424 0 
tgtgtgtgca tgcttgtgtg 114300 
tatccagggg ccttttcaag 114360 
gtctgaatcc acctactctc 114420 
cagctaagga ggtgtggctt 114480 
ntggggnnnn nnnnnnnnnn 114540 
nnnnnnnnnn nnnnnnnnnn 114 600 
agcattgcag gataaataag 114660 
actaatgttc aatgttttaa 114720 
gtctacggca tattaatgtg 114780 
cacagtaaca cgttttgaac 114840 
gtgcgcgcgc aggtgcacac 114 900 
ccaaataact ctttaagcag 114960 
cccttcaggc tatgacagaa 115020 
tgtcataatc tgtatccttc 115080 
agttttacac ccaagtcaca 115140 
cccacgctgg tcctgagcaa 115200 
caccactttg gactgtgtcc 115260 
aaatggcttt gaggcaccat 115320 
aagggcagtc attcccgttt 115380 
tggaggtcag atgataatct 115440 
tcttcccacc acacttgcca 115500 
agggcactgc catcagagac 115560 
atttgaacgc agatcatcat 115620 
ttttttaact ttttttaaaa 115680 
cacaccagaa gaagacatca 115740 
gggaattgaa ctcaggacct 115800 
tccagccctt atttaacttt 115860 
ggaagtgtaa cataagcaca 115920 
cagaggacaa ctctgtgaag 11598 0 
agggtcagca ggcattttac 116040 
aagctagaag ttggttaaag 116100 
agagggagga gaggaggtga 116160 
gtaaatctag tcgtgttcca 116220 
ctaaaacaaa gcccaggaag 116280 
ggggggaagg agggagaggc 11634 0 
cccaatgatc tccgtgatac 116400 
tagcagaagc cagaggcctc 116460 
atgtgtggac aaaagggtgc 116520 
gacaaaaagt cacacactac 116580 
acagtcagat tatttttctt 116640 
aacaggggtg agtgggatcg 116700 
gaaagagaga tgaagacata 116760 
ggagggtctc catgccaaag 116820 
agagttacaa agtcagagga 116880 
aggtggggaa aggtggggat 116940 
aagcgagcga ctaccagata 117 000 
tacctttata aaaaaaatac 117 060 
gctcttccaa tggtcctgag 117120 
taacaaaatc tgatacggtc 117180 
taataagtaa ataaataaat 117240 
aattctacta tgaatggagt 117 300 
atccaaactg gcctggaacc 117360 
tgtctccacc tcccaagtgc 117420 
tcaaacactg cttcaacctt 117480 
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gacccctaac 
ctggggcttg 
ctgctgctgc 
gtgcaaaccc 
tggctcttag 
tttttgtctg 
gctttcagtt 
agtaaagtca 
tcatcaacag 
gaaaacctaa 
gtttacaaag 
tggtggaaaa 
gcagggagag 
aggaggggga 
tgtatttgcc 
aattgcctac 
tggctctgaa 
caaaggcagc 
acaggcagga 
cctgctgggc 
tttggcaaaa 
aatacattca 
acagagagaa 
agcccttcct 
cagtctatgt 
tgaaggaaag 
acctatcttc 
gcacctcacg 
tctccttcca 
cctgctgagc 
agcctcattc 
gactcaattt 
gacccgagtt 
gggagcagag 
gttcagtaag 
ccgctgcagt 
ttgaagctgg 
acagtaaaca 
tttaatccca 
tctacaaagt 
aaaaaaaaaa 
aaattttcat 
aagtctccaa 
agcagtcgct 
aagaggacca 
tgggatgcgt 
ggaacaagag 
catctttaac 
agttcatgta 
ctctaattat 
agtaaccttt 
gaattccaga 
agctgcccac 
aaaaccattc 
tgcaaatcaa 
tggtcccaag 
agctatcatt 
caaacccacc 
agcctgccat 
ctctgccagg 
ggtgaggccc 
caatatttct 
tatgaafcaat 
acagcacaga 



ttccatctct 
ctggctggtg 
tgctgctgct 
agtcaagaga 
aggagcagac 
cgcatagtaa 
ttaatgaggt 
ctgggctcaa 
gacggtgtgt 
aagcagttca 
ggttctaccc 
tatgagttga 
atgagaacag 
ccgagcacgc 
cttcaccctt 
cactatttct 
ggctgaagcc 
taaaaaaaaa 
agtctccctt 
ctcctgaggt 
ttactcttaa 
gcacaaagtt 
aaagtgagac 
ttacagctca 
gaagcatttt 
tagaaatgag 
cagcgaagtt 
aaggtgcttg 
ccagagtcct 
cctttcccca 
tcagaaccat 
ctataaaaag 
ttgttcccac 
gtaggcagat 
ggagcctgta 
ggcatctgct 
aggtcaaggc 
tatattttac 
gcactcggga 
gagttccagg 
aaaaaaaaaa 
tttttataaa 
cttgataatc 
aagtccgggt 
gaagttcaag 
gagacacacc 
ttaagtgcgt: 
atctttggct 
cttcatcatc 
atagaacata 
atctcttgca 
gcattgtaaa 
aagacctcaa 
tacaaagcag 
gctatcccag 
ccatgaggga 
cttacagtct 
aacttcccaa 
ggtagtgcac 
agttccatgc 
tgtctcaact 
cagttcctac 
aagtatatta 
tacaggacac 



tgggtcccat 
gtgtcgctgc 
gctgctgccc 
tttcaacacc 
tgcaccacgc 
agactggttc 
tctccctgaa 
tctgtgatac 
tcgtggcagg 
attgtctctc 
tttcccccaa 
aagagtctca 
agttcagaag 
ttctaaagca 
tatacaaatg 
aaaataacga 
agcctcgttt 
aaaatgtccc 
gccactgagg 
cccaactcac 
gtagctcaag 
gagaagatta 
acctatttac 
gccttcccat 
taaaatcagc 
cacacagtga 
cttgccttct 
cagctatgtg 
agggatggca 
gcctctcctt 
ttctgctgcg 
ataactgagg 
atggggcgtg 
cccaggcctc 
accacagaag 
catagccaca 
cagcataggt 
tacaattaaa 
ggcagaggca 
acagccaggg 
aaaaaaccaa 
aattacagtt 
tgaggctggg 
agtcccttgc 
gtgctcctcc 
agtaacagca 
tactcccgta 
gagaagagaa 
atgcagttaa 
agatgtggta 
aatctggtac 
tagcagcaat 
atcaatcagt 
taacgtaggc 
gaagccagat 
actagattac 
attatggcaa 
gcaagcattt 
acctgtaatc 
cagcctggtc 
cccaaaccac 
tgacagaaaa 
tacagtacct 
acacacagct 



cttacccatt 
agaggctggg 
atatcctgga 
agcagtaaat 
tgcacaccct 
catttccctt 
gccttgccca 
taccactatt 
atccattaga 
aagtgctttg 
acaaagcaac 
ggtaccgtga 
cagccgtgtc 
gcaaataaga 
tttaccaagt 
gtttatgggg 
aggctgctca 
agagctcctg 
ctctcccttc 
cgggcatgct 
gaatgagacg 
aatagaattg 
ataggagagg 
tatcactcaa 
aagagaaagg 
cctgctgcat 
ttaaaaagaa 
gagggcagag 
caaaggcacc 
ttctacagtc 
catgtctgac 
agccggcttg 
cgtgctagag 
aatagccagg 
acactgtgcc 
gctacccagg 
accataggta 
aaaacaaggc 
ggcggatttc 
ctacacagag 
aaacccaaca 
ataattttta 
ggtgtagctc 
ctgtatccca 
tcttcaggta 
actactgcct 
actgtctgca 
aacctggtat 
tttgtaaaca 
attaggaaaa 
aagagacagt 
tgcccttcaa 
ctatagctaa 
tccgtttata 
cgtaattctt 
agcaggctcc 
gtgagctctg 
tcttaacaaa 
tcagcatgtg 
tagacagttg 
tgaaaacaag 
tggacacaat 
agtttaagtc 
gcaagcgtga 



cccagactcc 
gaagggactg 
gatcaaaaca 
accttaggaa 
gtgtctgcaa 
tcaactgttt 
tttctccttc 
tccatagcaa 
agagaaagca 
gcttcaaagg 
cttattttgc 
agtgctagtg 
aaggcaacag 
cgcgccaccc 
tgaagaatgt 
tttgatttta 
cctggagtgc 
aaacactaaa 
tccaactgta 
acatgcccat 
taattgtgtt 
attatgcttt 
ggcccaggct 
ctcagacacc 
tgagttgctc 
aagacacagt 
tgtggtattt 
accatcctga 
gggcttgaca 
tccaaattac 
tgaactggat 
gttggtagag 
aggtgaggac 
cagcccaacc 
gtgtaaccac 
aggctgagcc 
gacccccatc 
cgggtgtggt 
tgagttcgag 
aaaccctgtc 
catgtaccta 
gttgaacata 
atttagtaga 
gtgcttgggg 
atgaggagtc 
ggagtgctca 
caatgatgga 
tcctcagctc 
accaatcctg 
gctactaatc 
cccaaafccaa 
ttaacgcatt 
ggaaaaatct 
ataacctgtt 
aggctctgct 
gccctcggtg 
ggcagagaaa 
cacaaagaat 
ggaggcagag 
caagatcaac 
taagacgata 
taggctgggt 
tgagcaagat 
aggatttaaa 



cattctgtgg 
cttgctgctg 
ggccaattca 
aacccacctt 
agggctcact 
agaattgaca 
aagaacctgc 
caaatcgatg 
gggtggtaag 
aaaaggagac 
aactgacagc 
aactgtcaca 
ggagataaaa 
tgtcggagtg 
taacattgta 
cttctgtgat 
ttgtagatga 
actggtgcgc 
agttctaact 
ggaattcatt 
gctggagaca 
agttcatcct 
actgacaatg 
cagcccagtt 
agtagtacac 
ttaaaaggtg 
gctggtatgt 
gggaatcggg 
gatgccttta 
ctgaggtagg 
atggtcactc 
aaaacataaa 
ggactaggca 
tgagctctaa 
ggcaagcggc 
acaagactca 
tcaaagttga 
ggcgcatacc 
gccagcctgg 
tcgaaaaacc 
ttctaacata 
atacaatgat 
gtacctgcct 
aacaggcaga 
tgagccagcc 
ctctggacca 
gacagtacgt 
gttctggcta 
gcagcaataa 
cacttaatag 
atgatggcaa 
gtaacgcagc 
ttctaaagcc 
ttgggccacc 
ggctacacac 
acctgctcat 
aattcacaaa 
aaataaatag 
gcaggcagat 
aacgctatat 
tggatcaatg 
ataattccaa 
attatcgcca 
ggcacccatc 



117540 
117600 
117660 
117720 
117780 
117840 
117900 
117960 
118020 
118080 
118140 
118200 
118260 
118320 
118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
120000 
120060 
120120 
120180 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
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cctacagtat 
catttttgct 
aattcttact 
gcgctggact 
ctttgctgtg 
acaatactcc 
aaaaaagcac 
ctcacagaaa 
aacaagtcca 
gacagacctc 
taaagacaca 
gctggggtac 
cattcccagt 
gtgtacccag 
ttgggttcaa 
aaccacagag 
gggagggggg 
gtaaacaagt 
aattttttta 
ccatgtgggg 
taaagaaacc 
taggagccca 
aattaaaaca 
gttaagagca 
agctaacgcc 
gcccgcaaac 
gattctttca 
ctctgcagac 
ggaattaaag 
aaagcgtttt 
acataattaa 
gattcactca 
atgaagactt 
tctttgtgtc 
cttcctacct 
tggactacaa 
tataagtgtg 
caatattagg 
cttgaagcaa 
acatacacac 
gcggtgagga 
gacaagccag 
atgccaggga 
agaagctgta 
gatatgggtc 
agcacacacc 
tggtccacca 
gtggctgcaa 
agaactggga 
acagaaggag 
aaactggtgc 
aaaaagacta 
gaagcaagcc 
tgttaaaaaa 
aggaggcaga 
agtgggttcc 
agagaggaag 
cagcgtfcatfc 
aagatgaaaa 
gctcattctt 
gactagagtg 
aactggagtt 
aggcctgaag 
ttacaaacag 



atgcagcatt 
caatatggca 
agcctgggct 
agcttccaga 
gtgaaaggtg 
tggcttggta 
agagctcaca 
tcaagaagaa 
cttcagaggc 
actcagaaca 
gaagatgggc 
gctgtaacaa 
taacccagca 
gaggtaattc 
attaggatca 
ttggttaaat 
ggagggaagg 
ttcaggctca 
aagaggacag 
tacaggagca 
ttaaactgtg 
cccattgcac 
aacattaatg 
cagcgcccat 
ctcttttggc 
attcaaagaa 
tttttttgag 
caggctggcc 
gtatttaaca 
gttgtttttt 
aaatgccaat 
cacaacttat 
ctggatacat 
agacattagt 
attgcagtgg 
ggcagcaagt 
tgtgggcact 
catcaagagc 
aagaaactca 
acacacacac 
actgtaaaaa 
ggcctctgcc 
ccactctaac 
gagagttgat 
agccatgctc 
ggacctggca 
tagtggctgc 
agaggacaaa 
agtacccgct 
agactggtta 
ccatcactac 
cagagtacgc 
agactggaga 
aaaaaaaaga 
ggcaggtgag 
aggaaagcca 
tggatccaag 
aaatgacagc 
aataaacaca 
gttatccaaa 
agacactgct 
tgagggaggg 
cccgaaggtt 
tttaaacaca 



gactgtctag 
gcagtaataa 
gtgtttgctc 
ggatgtaaat 
ctgtttctga 
acaatgccaa 
tctgagccaa 
aaacaagcaa 
caaaaaccaa 
caagctatag 
tgctggcatg 
gcagaggaga 
atcacttctg 
actgcggctt 
agcacacttt 
aaacaacaga 
gggagaagct 
gtcggggtat 
aaaaaagaca 
gcatttgcat 
ttatctataa 
gccatagttt 
cttgttaaag 
atggaggctc 
cttcaagagc 
aaatacaaac 
acagggtttc 
tccaacccag 
cacacattat 
gttaagtttc 
gtgaaataaa 
atacttaaat 
tagaaacgta 
gtgagtgtct 
ttctcgctaa 
cccagcaatc 
agccttgtta 
tcfctaactgc 
cagatggtca 
tcacacacca 
taaggtgtca 
ctcatcagct 
atggagctgg 
ttgatggatt 
tagtgagtgg 
agtcaataaa 
tcgtggttgg 
agtctctgga 
gggacagaag 
tcggaggaaa 
aacagcagta 
accctggcta 
aaaggaaaca 
aagccagaca 
gcaggaggat 
ggcaacaaag 
agcaagagag 
agaaaagatg 
agttctaaat 
caaactaaag 
ttaaaaaagc 
agtgggcccg 
gaactgagaa 
ataccagcaa 



ttttatttcg 
aafcgtatatt 
actaactcca 
ctaactttgg 
agccacgaca 
aaaataccaa 
aaaaaccgac 
acaaacagca 
aaggcaaagg 
ccatgatact 
actgggaaga 
atgtctaact 
gcaaaaaaag 
ctggtggatc 
tatagtcaga 
acagccacac 
agaatcccag 
aaggtaagac 
gcgtagagaa 
ttggttactt 
agggcagcag 
tgattacaaa 
aaacagggct 
tcagccatct 
actgcatgca 
tgctataaaa 
tctgtgtagc 
agatctgcct 
acatatcttc 
aattaaaaaa 
taatatatac 
acaattttca 
ccctgaaaat 
aaacttgcat 
acctggaacc 
ctcctgtctc 
catggctgct 
taagccatct 
gagtttacac 
aggcttagtg 
ggaaagctct 
cttagtaact 
gatgggctcg 
ctaaagtagg 
ccccacaccc 
acaaaaacaa 
gagctgtccc 
actgatcaac 
cagactctga 
tctgaaacat 
attgaattgg 
tcatcaaccc 
aaaagttatt 
tgggggtgca 
ccatgagttc 
aaaccctgtc 
agagagcaga 
ggcccgacca 
atcattttaa 
caggggtggc 
aatagatgca 
ggaagcctgt 
catctcaaga 
taaattgtca 



cacactttga 
tgcattttga 
gtggacttct 
ttctcggcct 
gtcccatggt 
aaaaaccaaa 
actccctatt 
cataacaggc 
ggctcttaaa 
gctcttcatc 
gaccagtgtg 
gtgtctgcca 
cccactctgc 
ttttttcctt 
gcacggagac 
cagaactgct 
cattcaggat 
tttatttcac 
ctagctatct 
ggcgaatagc 
tgggtgcaca 
gtcacgcagc 
gcagagatga 
gcttctccaa 
catggtacgc 
cccatatatg 
cctggctgtt 
gcctctgcct 
cttcttcttc 
ctacatagtt 
atatataaca 
caataatgaa 
cgcaaatgac 
aaaggctctc 
tggggctcct 
acctttcttg 
gggactagaa 
ttctaccctg 
acacacacac 
accactgtga 
gcacaaaatg 
atggttgcct 
agaggccctg 
gggaatcgat 
acccatgagt 
aaacaaacaa 
gcacttatgg 
tctagagtct 
aggtgatcag 
aactcgacgc 
gcacaacatt 
aggcgattct 
taacaaaact 
cgtttttaac 
gaggccagcc 
tcaaaaatca 
gagagtgggg 
atgacatccc 
taatggctgt 
cgggatttac 
cgctaaccat 
gcctataaac 
caaagcacag 
gctttatgac 



atcatccatc 
tgcatggtgg 
gacgtaagag 
cccctgaagc 
ggtttgagta 
accaaaacca 
ttttgaagaa 
taacaacaac 
catttacaag 
catcaattct 
ctcactcatt 
atcacaggca 
ccgcacactg 
cttgcagtgc 
aaaaggatct 
ggggggcggg 
gcagaggttg 
aaaaataaat 
tttatgtaaa 
cctggtagga 
atgcagtggg 
tctactcaaa 
tggctcagtg 
ttccagggga 
ttacctacat 
accatcttaa 
ccagaacttg 
ccacagtgct 
ttcttttttt 
ttataggcaa 
ttctgtaata 
aagctttgaa 
ggttttcatt 
ttctctatca 
gttccttggc 
gaaccaatgt 
ctctggtctt 
attagaattt 
acacacacac 
aaagggaagt 
gtgtcctctg 
acagcaaacc 
ttattaaagg 
ttcctttatc 
atgtgtggac 
ataaacgttc 
caggaagaca 
gcttgttatg 
gacagagatc 
atactggtcc 
cagaaaacag 
ggcactattg 
tcccagagca 
cctagcactc 
tggtctgtac 
ctgactgggg 
gttrtggatgt 
agaaatggca 
gtgtctgctg 
ggccagccag 
taatcagcgt 
ccaacctgcg 
gcacaatctc 
agataggctg 
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acaggcatac 
cgacacagct 
cacagcatgg 
tggaaaaggg 
agggcccagc 
gtacaactat 
gaagcacaaa 
agttgagaca 
ctctttacaa 
gccagaggcg 
ggtccaggga 
ctacaggagc 
tcagggggct 
gagttcagtt 
taaacatttc 
aaaaaatctc 
agcaaagcaa 
aaaccccagg 
accgaaacag 
accattcact 
cgatgaacag 
gatagctgcc 
gatccaaggt 
ggaggagggg 
tgctggcctc 
ctcacccagt 
acaaagaaag 
gatgtaggct 
tgctatcacc 
tcctcagcta 
cacatctgta 
acttaaaaat 
nnnnnnnnnn 
nnnnnnnnnn 
tgtcatcttg 
attttattaa 
ctgtgccagg 
gtaacatggc 
gcttacttzat 
agcgatggga 
ctaacctcag 
agcagagccc 
ggcgggaaga 
ttgtgtgtgt 
tgcatgtgtg 
catacaatgg 
gaacacattt 
caggaaatgg 
ctttcctggc 
caggcaccag 
aaataaaaac 
gtggatattt 
caaaaactgg 
ggtccaaagt 
tcttctggag 
aaaaaaaata 
agggaaagcc 
aggatcacag 
agaccccggt 
gccacaaaac 
tggtaatttt 
aactcactct 
tggattttga 
ccaaagaaaa 



cacaaagatc 
gagcctagtg 
ccagagtgac 
aagaaggcaa 
atccctctcc 
aaatactcct 
cataacactg 
tcaaacacac 
agtctccaca 
gcggcacaat 
gcactagtga 
ctagacaccc 
gatgagatgg 
cccagaaccc 
tatcctccaa 
tcgtggacag 
atccaaatat 
tcgctttagc 
aaaaggcatt 
cttaataaaa 
gaacacaaac 
agactgggct 
gctgcctccc 
tttgacacag 
aaaactcttc 
tttaatgcca 
atcacacaga 
ggtctcaaac 
catggtggca 
ccatgggtgt 
accccaccac 
aaataaaaac 
nnnnnnnnnn 
nnnnnnnnnn 
aagggttgaa 
aaaacaaaac 
gaacccactg 
tgcttacgca 
aatatcaaac 
aaaataacct 
ctgcagactt 
tgtgcctgtg 
gatggaaagt 
gtgtgtgtgt 
tgtgtgtgta 
ctatgcttat 
ctaaaatttg 
gtaggagagg 
accaactagg 
gcaggaaagc 
ggagcagaag 
tgtgagttcc 
gggctggtga 
tcaaatccca 
tgtctgaaga 
cctttaaaaa 
acactcgatg 
cgggacggag 
ctcaaaacaa 
accttcttct 
ttgtttgctt 
gtagaccagg 
tttgggggtg 
aacacccccc 



gggaagaaga 
acaggccgtg 
agggtcacag 
agagcaccac 
cacctgacag 
tcctagtgta 
taagttacca 
gtttgatcct 
tttataaacc 
ggtggctgag 
attcaagtaa 
tgtaaggtca 
ctcagtggct 
gtatcaggca 
gcacagacac 
acagcaaaaa 
gcgaaaagga 
atccaaaaaa 
tgattattaa 
atgttaccaa 
tggctgggca 
ggcatgagaa 
ccccccccct 
ggtctcactc 
tgcctcagtt 
tttaacagta 
caaacatcac 
ttgcacaaga 
tttcggaagc 
gaggccagcc 
tcaggaggct 
ataaataagg 
nnnnnnnnnn 
nnggaaaaca 
atcactgagg 
tccttctgga 
ccagcagaca 
gacaccacac 
ataagtacca 
gcacatggtc 
gtgggtactg 
ccagggaacg 
ggagctaggg 
gtgtgtgtat 
tgtatgtatg 
gtgtctttta 
gaatacttgt 
caggagagat 
tcactcacaa 
ggccccacat 
ctgggcctgg 
gggtctacat 
gatggctcag 
gcaaccacat 
cagcaacagt 
aacaaaacaa 
gcgcaagcgt 
gccagcctca 
aacaaaacaa 
tcccaatcct 
ttcaaggcag 
ctggcctcta 
tgtgggtgat 
ccccaaagta 



aacgggattc 
ataagctcaa 
caatacagca 
gagagaaaag 
agaaaaccag 
gacagagttt 
gaagtttaga 
aggggaatta 
aaatacactt 
gcagcaggat 
acattaaaca 
gtactactaa 
aagggcatgc 
gttcgcccac 
acataaacat 
atactgaacg 
taatacaatg 
ttcaatcatc 
taaatacagg 
acaggagaaa 
cagtggcaca 
ccaagggtaa 
gtgtcactta 
tgtggcacag 
ccagagtgag 
caaggttaaa 
acaggtgctg 
aaagaatact 
agaggcagga 
aaggcagcac 
gaagcaggaa 
ttggagagag 
nnnnnnnnnn 
cccattagac 
aatggctaat 
ggagtactgt 
cccaacttca 
acacttcctg 
tgtgagcgct 
agtaatgggc 
agaacggacg 
gaggacgctg 
cccgatgacg 
gtatgtgtat 
tgtgtgtatc 
ccacaagtaa 
tggggaagct 
ggcttagtgg 
ccaggatctg 
ccctgcaggt 
aagtccttta 
agtaaaaact 
tgggtaagag 
ggtggctcac 
gtacttacat 
aaactcccac 
gtgatcacag 
gctacacagg 
aaaaagcgtt 
tgaatgctat 
ggtttctctg 
actcacagag 
accacaggga 
tgaacatcaa 



ttgcacaaca 
aacatgcaca 
acagagacag 
ttaacccgcg 
gaggacgcag 
accaaagggt 
cacgttgaga 
aattattggt 
gtatcagaat 
caagtctgga 
ttaaataaga 
gataccaaaa 
actgctcttg 
cfcgtaatgta 
aattaaaaat 
aaatctaatc 
caaatccaga 
ataattcacc 
gaaagcattt 
gaagtaaaat 
tcctgaaacc 
gaggcaccca 
aatccgtaat 
gctagcctga 
gggaaacaat 
cgctgcctga 
tttgagacta 
"tttgccatca 
agatcagtag 
gtgagccagc 
gttcaagcca 
ggnnnnnnnn 
nnnnnnnnnn 
tgctgaagag 
gacaaaactg 
gctcccttca 
aggataatca 
tatgctgggc 
acagggcctg 
aatccactcc 
tgactaagtc 
ctcaatgggc 
tgaatatggc 
gtggtgtgta 
caaaatggtt 
aaaattttaa 
atctgagccc 
ttaacacaca 
acagcctctc 
aaaacatttg 
cctccagcac 
tgtcgccaag 
cacccgactg 
aaccatccgc 
atattaataa 
taaaataatt 
tattctgtag 
ccggtctggg 
cacattatca 
cgaatttggt 
tgtagccctg 
atgtaattct 
cacttgggcc 
ctgtatataa 



ttttacaaat 
ctgcaaacac 
taagaaacat 
attccatatg 
ctgaaacact 
atcgtaatct 
atttaataga 
aacagaaaag 
tgttgtctga 
gaagcccctg 
aagacggatt 
ccaaactgac 
cagaggacct 
ccagagaccc 
aaatcttaaa 
aggtagactc 
cttatcccag 
gtattagcta 
gacaacattg 
ctcctcaacc 
ccagcaccca 
aacacttaat 
taaatttctg 
atggagtcaa 
catgagccac 
aaagaacaag 
ggtctctagt 
agatcaaggt 
tacaaggtca 
tgtggaagca 
cacaagagcc 
nnnnnnnnnn 
nnnnnnnnnn 
actgagaaat 
cttaaggaga 
gcattaacta 
agcttgctac 
atcacctcta 
gccaatgaac 
ccagacactt 
aattcacaga 
agagcttcag 
taatgccata 
tgtatgtgtg 
ataaaaatcc 
gtaatcttag 
cagcatgaat 
catggtggcc 
ctggcctcct 
tactgaaact 
cgggaagcag 
taaaacaaaa 
ctcttccaaa 
aacaagatcc 
ataaatcttt 
ggaaaagtcc 
gtaaggcaag 
ctacagtgtg 
tatactcaag 
tacttttttt 
gctatcctgg 
ttttcaaagc 
ccaaaccaaa 
aactaacagt 
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tcatataagc 

tctttcagta 

atgtaaacga 

ggtcagggaa 

cgtcctcacc 

tgcccaagcc 

actttaaaag 

aagaccttgt 

acaatggcaa 

agggaaggac 

actagtctat 

ctctgcatta 

tgttcatttg 

aaccgcatgg 

ggaacticggg 

cccaattcac 

caggctagct 

gtttccaccc 

ttcacaccaa 

atatgcactt 

tttactaggc 

accagctctg 

tctgtgcaat 

ggtaagtcct 

tggcataagg 

tccccaaggt 

acccactggt 

agcgaataaa 

taaagtaact 

taacggttcc 

acagtatcta 

taggtgattc 

ggccgtgcta 

gcttaaaatg 

ccaagtaaac 

ggcttcgcac 

cctcccttat 

aatctgcccg 

ttaccccgta 

gctgcccagc 

ggcctatctg 

acaccgtagg 

tgggaaattc 

tcttattgca 

ccaccgctgc 

aagcaacaca 

gtgtacatcc 

aaatcacacg 

ctttttccat 

ctctgtctct 

tctccggaac 

cctcccaagc 

tatagttatt 

gtgtgcagtg 

cctactgtgg 

tgagttatct 

tcatataaac 

cccctcacca 

tactcactga 

tctttaggga 

agccattcag 

agacaacctg 

ctcacaggct 

atgactaaga 



taagtagcct gcaggtacat ggttatggaa acagctttat catacagact 129060 

gatgccattt gagaaaaaaa aaaaaacaaa acattctttg gattttcaac 129120 

aaatatgtta aatcttaaaa aaccttaaag cagacgccac tgctctttgg 129180 

gcacggctgc aggcccccag agcagcacat tccctgaggg aagccttgtg 129240 

agcaccaaga acagccaact ataataagct cataaaaaat cctgaaaggc 12 9300 

tgaaaatctg atatgaaaac agagggcaag acagataaaa ggcaaactat 129360 

tcaattccag ctatctgtgt ggaaggactt ctcgggacgg actatctgct 12 9420 

gggtgattta aacagcgtgg agggcagaaa aagaaaggaa agtgcaggaa 129480 

aagctcctcc tccgtggcct ctacacgcat ccaagcgata tggagggagg 12 9540 

actatacaca ggtaccaatc cccacgaact agaaaacaca gcttttacta 129 600 

tttttttttt accttagcta gtgtctttct tatgtttggc ataatttctg 129660 

aataaaccta gtgtataaga aggcaaatga gtaaaggtaa agtcagttaa 12 9720 

ttttgactgt gtggtgtgca tgccaggcgc attcgtggag gtcagaggac 129780 

agcaggtctt ctccttccac caccccaggg gtcgcagggg tcccagggat 12 9840 

tggtcagatt tgcttggcaa gggcttcact cactgaggcc ccccaagggt 12 9900 

ctgtttactc taatgtatat atttttgaga cagggtcttg ctatgtagtc 12 9960 

ttgaactcac tgtggacact agctggcctg gaggtctgag tgaacctcct 130020 

aagtgctggg acaacagaca gcaaccatca aggcaaaatg aagtcttccc 130080 

agtggcttcc tatgaccttc tcctgacaac cccaaacagc aagtgcccgg 130140 

gcgtttctct ttttttactt caagctttga ctccactttc ctaaaaggtg 130200 

tgaagcacac ttcaggagac accctgtagc tctggagtga ccggaaacac 130260 

atgcaaaaac aaaaaacagc gatggggatg gggcaaaggt ggcttgtgct 130320 

ggcattccct gacgcattac ctcacaccag atacatacag aaaagacaaa 130380 

cactcagtgc gcgcatccac cacacccaaa gcaagcattc aacagctcag 130440 

ccagcattat cacagcgcgg acaaacacaa agccaacctt ctctttgagt 130500 

acatagtgtt ctgtccaagg atggcctctg cagactcctg gaaacacgtc 130560 

tctcttgaaa atctgcaata tttgcctaaa aaacacaact ctattatttc 130620 

taacaaaagg aagacacccc aaggacgagg" gaagattatc ttttcctact 130680 

tccaagggag aaatgacatt gccgccacac acagcccccg actctttggt 130740 

cccttatgag gaagtattca cttctggtca ccccgtgttt cacttttagt 130800 

atgaattaca ggagccactc aataccttta tcataaagtg ggctttgtgt 130860 

tgaccaagcg taggctatgc aaatgctctg agcacactta ggggaggctg 130920 

gccctgctac atgcaggtaa tgctacgcca gtcttacgcc acaggaagaa 130980 

tggtcgaact caatctgcta gaggtgtgag tttagacctc gggagacggg 131040 

ttccgcagat gtcggagcat cagacagagc tgtcccctcc tgatgcaaaa 131100 

ggcaagtttt taatgagcct ccatggtaat agtgggttcc tctctctcct 131160 

caatatgctt gacatctttt agttttttga gacagcaaat aacgtagccc 131220 

catacttact gggcagccac agctggccct gaactcctcc tgcgtctcct 131280 

acaagtgctg gggttacaga catatgccgc catgttcagt gtaagtgacc 131340 

agggcaaagt cttccttatt tacaaagcag cagccaagcc ctgcagccca 1314 00 

atttcctcag cacaccccca agggtctcac cgatggcagt cagtccatga 131460 

cttctcacaa tccacactac tcaccgataa gatcatgcgg tatttgaaat 131520 

ccggtcacac ttctcacagc ggtacaaccc attctgctgg tcaatcactt 131580 

gtcctgggtt gggcaggcct ggtacataca gttctctttg cggagaaaca 131640 

cacagtgctg aaatagtccg cctgcaggcg aaaggaagac cgccatcagc 131700 

ggtctggaac aggcaactca aagctgctct tccttgcaag tggtgagcgc 131760 

tagctcccac gctcacgagc gatatgcaga atcactaact ctggtttcag 131820 

tgctacacgc agaacccaaa gaagtaacaa accggcactc acgcacctca 131880 

tttggagacg gcggctcact agctagcctt gaactcagag tttggtctgc 131940 

atagagtgcc tggctaccac acctgggcac tttgttttcg agacagggtt 132000 

tcactctgta gaccaggctg gcctcaaact cagaaatcca cctgcctctg 132060 

gctaggatta aaggcatgtg ccatcagcgc ctgacccaat tctttttatt 132120 

atttgttaca tatgtgtatc agtgtgtatg acgtccatat gtatatgact 132180 

tgcacatgtg tgcaggtcag aggacaactc tcaggagtca gttctctcct 132240 

cgtttgggga actcaggttc caagatagca ggaaaagtgc ctttaacagc 132300 

cgacactgac atgtaatgat tcagtgtgca cacagtggtt ttgcatgtag 132360 

cacagcgtgc atgtgcaggc taggaaacaa cctgtgggag ttggtgatct 132420 

tgtgagtgag ggagaggaac tcaggctgtc aggctcggtg gcagtgcctt 132480 

gttcccttgc tggccaagca ttatttataa gatggtgatg tctaccctta 132540 

tcagtaagtt tttccccaag acaagccaga aaaacctttt aggcccaatg 132600 

tcagtctcta ctgccactcc tcaacctgtc tgtggggcag gacaagccac 132660 

aagacggaag gtgagccaat aaaatcttac tgacagtaac agccagccag 132720 

tgacaggcaa ttcttggact gaatgcgttt aagagaatgc agaattaccc 13278 0 

tcttctaaat ggaaaaatgt ctggttaagt aacccagcaa ggagctaagt 13284 0 
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cacgcaagcg gtggatacct gctttgcctc tgaaccctgc acaggttttg gtttatgttc 132900 
aatcatgtca agtacctaca aaatccagat ctagcctgaa ttcaaagata ggctacctac 132960 
cctgcecccg gacccccacc cggggtctca ctgtgtagtc ctggctgtcg tagcgctctc 133020 
tatacagacc agctgttatc aaattcagag acccacttgc ctcccaagtg ttgggattaa 133080 
aggcatttgc cactatgcct ggctctcact ggttgtacca gaggagcaaa acaatgtggc 133140 
catttaaaga gacgaagcta gaaaccagtt tagacagctt tggggctgtg ggtgtagctc 133200 
agtggaagag cttgcttagc atgcacaagc tgttggtttt aaccctcagc gtgacagaac 133260 
cgaatacata agaaggctta gaggaggggg tggtggtgga gagatgggtt agaggttaag 133320 
agctggttgc ttaatttccc agtccccaca tggtggctca caacatccat aactgcagtt 133380 
ccaggggatc tgatgtcctc ttctgacctc cttggggaca tgcgactcat ttggctcaca 133440 
tgcaggatgc ctcgggccac tatgctggct caggggtgaa ggtgcttgct gccaagctgg 133500 
gtggatttga gtttggtccc tgggacccac aaggaagagt ttgacttcta tacactgagg 133560 
tggaacatgc atgctctacc cgcaaattaa aaacttaaaa tttaaagagg aagctgtaga 13362 0 
gaaatagctt ttaggaggat gcctaaggaa cttctctgcg ttttcaggtg agattcagac 133680 
tcaaagccca atttaaaagt ttgagtgctg tcacgtgttc tgtatgccca gttctggctg 133740 
tccgttgtct gtctttagtt taagagagca actgggtgag aagtaactga gagtctagcc 133800 
gatgtttaat tctcaagatg tcctgtgata gcattataag ttgctgtgga tgacagtgat 133860 
ggatcgagca cacagtgaaa tgagacagtg aggaagaaaa cctatattgt atacgaggac 133920 
aataatggag cagtggcgag cgatattttg ggaatacaat agaaataaat gattcaaata 133980 
tccaagagaa cgcaggttag accaaggtag gaagaggatc actgggctgg agagatggct 134 04 0 
cagtggttaa gagcactgac tgctcttctg aaggtcatga gttcaaatcc cagcaaccac 134100 
gtggtggctc acaaccatcc gtaataatat ctgatgccct cttctggagt gfcctgaagac 134160 
ggctacagtg tacttacata taataaataa ataaatcttt aattaaaaaa aaaaaaaaaa 134220 
aggaagagga ccactgagca cacattgaaa tggaagatgc aactgaaatg caataccaga 134280 
gccagtctgt ggcatggcag caggaattac agtctgttca aaacccagca cggacctgag 134340 
gctacattat gtccctctac actggccgtg actgaaaagc agcaacgtgg taccctggag 134400 
caggcctgag gtcccaggta aggacacagc cctgacctgg cactaggaaa caggtgtaaa 134460 
aactcaagcc cctgccgttc tctgggtgtc tggagcgagg ggtgcgaggt accttgtctc 134520 
cctggcccag gttctcagat ttagcctcat gcaaagtttt ccagttggtg ttgccccctc 134580 
cggcccctcc actcctgtgg tcagagatgg aaacaccatc taaggcttgt ccttctgagt 134 640 
caaacctagg ggagaaagaa acaaacgtac tgctgacagg aacttcacat ccttctcaaa 134700 
tgtgctgtga atgcaccagc cccgagctgc gccccctcgg ctctcaccta ggtctcagca 134760 
cacctagtcc actcaagaaa tgcaatgccg ttgctccttc catctcgtgt ccttcaatgc 134820 
ccctcctcct ccacccgcaa ggtcagacaa actgccaaag taccacagac cctttcccta 134880 
aactgggctc acattgacag cagccatgaa caaaggcagc aacagcaagc cacactggca 134940 
cccctccccc caccctcggc ctcaagcctt caccccaatc accaaaatga accagatgaa 135000 
aagggacatt tttgtttcat gagtctgcca aaaatgtttt catgggagaa aatcttttaa 135060 
gcccaatcag ttatagaact caatcagaag gcattatctg ctgtgttaga gatttgcatt 135120 
ctcagtgaga atttgtttat ggaaatatca aggttaaatt catttatatg aaattattat 135180 
taaaattgac tatacttctt gaccatctat tggctaatct gaaggaaaag aggccaggga 13524 0 
gaaggtagca aggacaggct agtggcagag gacagtgagc ctgagatgaa aacacctcaa 135300 
ctacacgatt ggtggtaact aaagctgccc ccacacagga ctgactccct aaggagactc 135360 
ccaacagagt agcggtggcc caaggcaagc agtcacatcc gtcagaggac aaggtccata 13542 0 
aatccgtctc accactacta gccgctcaag tttaacttca cacaggcatc ggcaaccaaa 135480 
gacggctcat caggagtgaa cagagccagg agacaggcag cctctcttat agtaaaaggt 135540 
ccactcttag agccaagatc taaagatcgc taatccttgc ctgggtaggg ggggtatact 135600 
gaacttgtag atcaatgaca gtattttgtt cgggggacag acatgcctgt ttgtgatttc 135660 
tgtgatcaca gttactgaca gtgaaatatg aacaccttaa tgtagacaca caattaacct 135720 
ggggttgtgc tcagagtctg ggagaaaaac caaccaacaa accagaccaa tgccatggct 135780 
gtctctggtc agacttctgt cctacaatgc aacgctagct atcagcacag ttacacagcc 13584 0 
agacccacac cactgaggat gccttaagtg acagacaccc cagatggatg ctctaagtag 135900 
taaatattgt tattggttta catcagactc aaaacaaaaa tgaggagcgt gaaatatgag 135960 
ctgtttttct tgagtctctt aaattctacg acacagctgg aaaccacaca tgcccaccct 136020 
ggtactacag ctattcaatt tccttaagct tgggctaggt aaagtatttt ctttgaccac 136080 
ctcaatcctt ctacacaaca ccctcaggat gaagtgctct tccaaggcag agtttaatgc 13614 0 
cttaaaaagc tagatatggt gtggaggttc taacaagctc tcctcatctc ctgagtgctt 136200 
tgggaatgac aatgtacagg caacaaagag tccattgttc tcactgtact ttaccgagtc 136260 
cagtgacaac acaacagaag atgtgaccag ctccagagac tcagcagagt aagaaagtac 136320 
attctacccc cacatcagct gcccacagtg caaactgaag atgctctcag cactgttctt 136380 
cagcgcgcag ctgttaaagc cttgccgtac ctggcgggat caaggtcagc acttagattc 136440 
taattgcttt ctctggctct ctatgttcat caggggatcg tgggtttgtt ggtttaagag 136500 
ggctttcagg ccaccatctc agaagacata atgcctcgaa gaagtatagg aaccactcca 136560 
ctcaatagca aaggcttggc aaaacagctt ggcagcagtc cacatgctga cagtgtccag 136620 
ctctggttta ccagagcccg agcacagata acccctgagc tgagttgcac aaaacctacc 136680 
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agccacgaag cttataggcc tctgggatgt caggattcac aatgacagtg ctggatgaga 136740 
ggaccgagag gctccgtcca ccgaagtcag agactcgggc tcctttgatg gccatcacgg 136800 
gctgccgaga gccgtcaaac ttgtcagcct gcaaatcaaa cgcacgcaca catcactgac 136860 
aagaagaatg tacaggatgg ttttatggag aagagtcagt caatgtctgt ggactctaag 136920 
acagacctcc cactcgggaa ggagcattgc actacaagaa gctgcaataa ccgatcatct 136980 
cacacagcga aggtcttcaa atacttactg gatagcacaa ctccctgagc taaagcccca 137 040 
ccctcaggac tcggctcagt gcaaggactc acatcttctc cccacagagt' tgtggtcacc 137100 
accttccctg acatgtccat caaatagata tttctcttag caacttctct gttgttcgac 137160 
ttcactgtga ttttaatcga atcttcatag ctcttgcaga ttccaatgat gtctgaaaca 137220 
aagaacactt tgtaagagct cccatcaagg ctcctcttta aaacacggca gggacgaaag 137280 
gcaagcacgc tagtcacgca tcaccccaaa cactggagac aaaaatcacc actctttgcc 137 340 
ctcaaccctg gacgcaccta ctagtgcgtc tttagccttg ctctctaggt caccgatccc 137400 
tgtgaaatca aactgaactg tgggtaagtg atggccatct tcacagggaa ggacagaagt 137460 
ctcattattg aaggtcatct catagtcatt tttaacagcg gagaactgtt tgttagcgat 137520 
cttcagggcg ccctttgaga agtaatacac ctgcaaaaga ggccaagtca gggcaagcta 137580 
ctcagtccat caaagcccgc cccacaatgc agccttccaa actgtgctca tggctctcac 137 640 
agttcagcct gtcagctctg actcaacacc gtcttcttac cttgttcact tcaataaggg 137700 
gaaagaactt gtccacttgc tcattgaaag cagtagctct gatttcaccc tgcagaagca 137760 
agggagagca cattagaact gcgctgctag gccctcgctc tcaacagcga gagcaggcca 137 820 
ttttgacagt tagacaccat cctcggagca acagtccaag tgtaaagaga ccctcacagg 137 880 
gctgcagaca ttgcagaatc ccacatatac agttacattt tcaagctatc agctctggta 137940 
aaacaaaagc agctcagcca cccgagccta cagttgtaag ttaacacata aaacgaggga 138000 
gcctcgagat gatctggaag acaaaggtgt ttcctzgtgcc aacctggcca ccctagttta 138060 
atcccggaac cctcgggatg gagaagacaa tcgccccacc aagttattct ccagcctttg 138120 
cacacgtgtg catgcataca cacgaaaata attcttacat cttatcaaaa catttttaga 138180 
aggaatagct taatattccg ataatgaaac ctaatattct gggtcccgag tgatgggctg 138240 
gaagcccaca ggaaggcgtg ctgaactcca atgattcagc actcttcctc acgcccgcaa 138300 
cctaagtgac ggcttacttc acagtcagga gctggtgcca aaaatgttcc aacaagtcat 138360 
gctttcaagc tgactgcact gttacttttc tgtcaggcaa tatgttacag atggataaga 138420 
aacttaacta aatggctaaa accttaccca gacaagctgc aggaaattag tttctgatac 138480 
tgaatctgcc atcacagtta agatatctgc tgtgctacac cgtgacaaga ggaagaagag 138540 
ggggagagag ccccaaagct ttgcccttcc cggtatcatg gcttatttta cgtgtgtggg 138600 
tgttttacca gcatgtatgt gtgagcatca ctgagtgtct gatgcctgag gtgaccagga 138660 
gaggtactgg atcactgaca cgggagtcac agatggttat gtgctactct gttggtgctg 138720 
agaattgagc ccaagtcctc tcaaagaaca gcgagtgctc ttaaatgctg agcagtctct 138780 
ctggcccctg cattttgctt taagtaaagc ctagtgagta taaatgatca gaagccctcc 138840 
ccgcagacta gttaaagctg agtagctgct gctccttctg ctggaggcaa acccgccctg 138900 
ctcggcaggt atcaccccag ggcttcaact gtgcccctag caatgtaatt agagcgctgc 138960 
agtctctgca gacggagact atttacagtc caccaatctg tattgctaca gacgcgtccc 139020 
tttaggagca ctttatctca tgtctgaccc tgtgccccag atagcatcaa aggtctccaa 139080 
cagaaggaaa gccacaatga ccgccatgtt cctcggagct gagccccacc cgnnnnnnnn 139140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 139200 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntttttatt aattatgtat agatatagat 139260 
atctctatat atgaatgagt atcctgtagt tatcttcaga cacaccagaa gagggcatca 139320 
aatcccattt acaggtggtg gtgagccatc atgtaattgc tgggaattaa actcaggacc 139380 
tctagaagag caatcagtgc tcttaacccc tgagccccct cccttttttt tttttttttt 139440 
ggcagcttat ttttaattgt tttaaatcac gtatatgttt ctgtatgtag ctatgttcac 139500 
ataagcgcag gcactcatag aggtcagaga tgtaagatgc ctttggagct ggacttaaac 139560 
atggttgtga gccatctgat atgagcacca agtggacttg ggtcctctgg accgccttgc 139620 
tgccttccac ctagaattac agtatatgca attagctgct cagccacccc ctcagtcctc 139680 
cttagtactt tcaacatagt gtaaatcgga gagcactcct tggtgcaaag attctgtgtg 139740 
ttctatcatt ctctacaccc aggcttcatt gggacgtgtg atgccagcga tgattcattg 139800 
gcgtgaagta tgagttagag tcagacaatc agtgctctct gctctgagtg ggtctccctg 139860 
cccctgtgtc tctgagacag gggaggattc tcctgttgga tcacatagtg catagagccc 139920 
tgtgtgcaca gcgctctcaa ccatttgttg ccatttcaaa tacagtgact tcagagtctt 139980 
ccttgaaata acagatttct ccattttgtt tgttcccttc tttcgtcaca aagacctgag 140040 
gatgagatgt ttrtgaagga actttctagt agttactcgg tggaaaagga caatgatgct 140100 
cccctcttct acagagaaga aggaaacagg aaattccaag aaaaggagta cacagatgct 140160 
gcagtgctgt acfcctaaggt aacgtgcgtc caaccagcag ttgaacaagg cggaagatag 140220 
aggcaaagtg cagactcata accagtccat tgttttgctt tggttagttg ggttggttgg 140280 
tttgtttgtt tgtttgtttc ttcttcttta ttagtcggat tggatttggg aaaaatgtag 140340 
tagaattcta ttgttgatta tattcacaaa caaaagactg tttatatgtc ctgaattttg 14 0400 
gcaagattct tcccatattc cttcagaccc atagcagcag caagcttagt ggccctttgc 14 0460 
caggtattct cactgagctg tacagcattt gtctgaagaa ctgatgaaca ttttagctta 14 0520 
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tgctcctatg 
aatgcctctg 
caggtgaatc 
gatatgtagt 
aaaaagaaag 
agaaagaaga 
tgttaacata 
tttaaattat 
tgctggccac 
cagatgtgag 
ctcttcgcct 
gtcgttcttc 
gaactgcatg 
tagagtcaca 
cagtgttctg 
ctgagtcatc 
caccacgctg 
ctgagtcatc 
ctgtgttatt 
cagagatctg 
attaaaagaa 
gggaagaagg 
acgtgagctc 
gctccacgct 
ggttcagagt 
actttccgaa 
tgtaatggtg 
cccatggaaa 
ctcaggctca 
ctgcagggaa 
catacacata 
catcagtcta 
atggagaggg 
tgcaatttta 
gtaaaagtaa 
tcataaccag 
tgtgagaatg 
ccactctttc 
caagtaattt 
agatctaata 
aatattttta 
gggtcaagcc 
tgaaagaatg 
tttgttagta 
aagagttgca 
cagccccata 
aacagcctag 
tgactttggc 
atccttgggt 
caagtgggcc 
tgagttcagt 
tctgacctca 
cactcgggag 
atttacagtt 
tcagtggtag 
agagagagag 
aaggaaggaa 
gaaagaaaga 
gaaagacaaa 
gagggcatag 
cctagcaccc 
actactgtgt 
agtaatctgt 
ctagcctaga 



ttagtaaata 
gctgagtttg 
tctgagttca 
aagactctgt 
aaagaaagaa 
aaatgagtgg 
ctactattaa 
tattttatat 
cacacataca 
acacgcctct 
ctacaatcaa 
ctggccagtt 
tgcacttggt 
ggttatgtat 
gatttaactg 
tctccagccc 
gatttaactg 
tctccagccc 
ctaactcttg 
tatgggctta 
tagagtgagc 
tcaggagttg 
tatgagaccc 
accacttgcc 
attgtgcaca 
aaatattaaa 
taagaaagta 
gttgcttaat 
cgagtgctta 
tccaaaaccc 
attttaaaaa 
ggctacacaa 
agatgcactt 
tagagagctg 
taaatgttac 
tgaagtaccg 
tgtttttctt 
ccttcacgat 
tccatagcta 
ttggcataat 
tgtacagcca 
tctgagtgac 
tgggaatgct 
ttatacattt 
aaaatatatg 
atctaccttg 
tatggtggcg 
tccgaaagcc 
acgtcactca 
aatgagatgg 
cttcagattc 
gtacacatgt 
gcagaggcag 
atataaagaa 
agcgcttgcc 
agagagggag 
ggaaggaagg 
aagagagaaa 
gcaaagcaaa 
atcagtgtta 
taccgcaaaa 
actttcacta 
tcttggtttt 
acacacagag 



gctaggtttt 
gtggcacttg 
aggcagcctg 
ctcaataaag 
agaaagaaag 
atttgccata 
caacaacaaa 
gtatggtgtg 
taggcagtga 
cctaggccta 
gctctcccaa 
gagcgcgcac 
acccagggag 
gggttctagg 
ctgagtcatc 
caccaagctg 
ctgagtcatc 
caccacgctg 
cttccttgga 
aaacaagaat 
ctaggagttt 
aaggctagtc 
tgtctaggag 
taacacgcat 
ccctgtatct 
gctacatcat 
tttggcttca 
gaaaacaggt 
ctgcccttgc 
tcttttggcc 
taaaataaat 
gaccttgtct 
cctattccat 
ctattttctt 
tttggaaaat 
ttaacatttc 
ttacaaaatt 
ttattgtacc 
actctgtgtt 
atttggcatt 
gtgttttgtt 
gctgaattgt 
catttcttaa 
atttaggggg 
gtgtggtttc 
ttaaatatac 
ttaacctcgg 
ttcctggaat 
tgtgcacaca 
ctccatgtat 
tgcaagataa 
catggtacgt 
gaggatctct 
actctgtttt 
tagcaagcgc 
agggaaaggg 
aaggaaggaa 
gaaagaaagg 
actaaataaa 
taatgcttac 
tatttgctct 
gcaacataat 
atttttcttt 
aactctcttt 



tagctagcca 
cctttagttc 
gtctacataa 
taaacagcca 
aaagaaagaa 
gcctagtcaa 
tactttacac 
ggaagctgtt 
aggttctttt 
tgtaagcagc 
taaacgtgtg 
aagagtatgg 
cccagaagag 
aatcaaaccc 
tctccagccc 
gatttaactg 
tctccagccc 
gatttggaag 
accctgagaa 
atgtccaact 
ggcacactcc 
agtcttagct 
aggagaaaga 
gacgcccaag 
ttaaaagtca 
ccactctaca 
gttttatggc 
aaaaataaga 
agagaccatg 
tctgcaggca 
cttttaaaat 
caagaagaaa 
gaagctgcta 
tcttttaaag 
atagagaagt 
tgcttctatc 
gggataatgc 
cattttctca 
ctgttacaga 
atgatgctat 
tgggatgaat 
tctcaggaaa 
gcaacactgg 
ataggaaata 
ttttcctgtc 
tgtatataag 
gttacttgag 
tccagctcca 
tacacatttg 
aaaggcagtt 
caggagagga 
gtgtagtatg 
gagttttagg 
gaaaaacaaa 
aaggccctgg 
agagggagag 
gaaagaaaga 
gaaagaaaga 
atacatacaa 
ctagcatgcg 
gtcttgctaa 
gagaatgttc 
gccaaattga 
atgaactcaa 



ccttgctagc 
cagcactcag 
caagttccag 
gaatgactta 
agaaagaaag 
actagatttt 
taaccatgac 
gcagagtggc 
gccaagacaa 
accagttctg 
cagaaggatc 
gttttcacct 
ggcatcatat 
atgtcctctg 
caccatgatg 
ctgagtcatc 
caccacgctg 
cagaactgag 
aattccttct 
cttgatctct 
tatgttccaa 
ttgtgacaag 
aaggggtaaa 
gtttccgttc 
atagtcaacc 
aacttgtaaa 
ctttgcatcc 
caggagggac 
gcaggcagtg 
accacattaa 
gagctctaga 
gaaatgaagg 
ttttggtgat 
taatgcttgt 
ataaagagta 
cggccagcca 
tgcacttact 
cagtattaaa 
gaagaaatgt 
aataagcatc 
tagcacaaga 
ggactagtta 
ttattattac 
tgctcatata 
ccctagttta 
gcatatgtag 
aactgctctt 
agggatgcag 
gtttttaatc 
atgcaaaggt 
ccaacccctg 
cacatgcaca 
ccagcctggt 
acaggggttg 
gttcagtcct 
ggaggagagg 
aagaaagaaa 
aaggggaagg 
tatattaaat 
taaaactctt 
attatattgc 
actatcccac 
tgggtgaaca 
gtttcttatc 



ttacaactag 
gaggcagaag 
cctggccagg 
aatattgcta 
aaagaaagaa 
cttggttata 
aagctatatt 
agttggctac 
gttaaccaat 
ggctcagggt 
ctgttgcagc 
acatttatgt 
cccctggaac 



gaagaacagc 
gatttaactg 
tctccagccc 
gatttaactg 
gctttgaaca 
attggctttt 
gattttatat 
gcttcaggaa 
tttgaggcta 
gtgaggtgtg 
ccagccctgg 
cttggaatta 
agcccttctt 
taaatgttac 
tgcaaagatg 
ggtctcccta 
cacatgcaca 
acgagtttga 
tttgctgtgg 
tatggtactt 
tgttttgact 
aaaaaaaaag 
gagtttttcc 
gttttgtagc 
ttttcagctc 
acttaattta 
tttctgtata 
gtaaagtgtg 
acatccattc 
atactattat 
gttcaatttc 
gtctcacttc 
gtgaataaaa 
gcagagaaca 
tgcctctggc 
ttaggaactc 
ctggatgaca 
cgagttgtcc 
gaagtcccag 
ctacaaaacg 
gggatttagc 
caactctgga 
gaaaggaagg 
gggaaagaaa 
aagaaagaaa 
tttaagactt 
ggcttctaaa 
tagttgtcag 
tcctctgtca 
agtatttcag 
cttttatgat 
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ctccagaggt ttgtttttgt gggtttatta gtgttttttg tttggttggt tgtttgtttg 
tttggttggt tggttggttt tgctttactt tttcttattc atttttttat ttctttattt 
ttttgttttt aatttaatgg actggttcca tgtggccaag gatagcctca actttgtagc 
agaaactggc tttgaacttc tggtcttcct tcatctacct cccaagtgat gggattaagg 
cacgtgccac cacatctaac aatatctggg tttctttatt ggagtttgaa agggattcct 
ccagcattac tttgactctt catagtttct tccagagtta ttacactttc atttgttaca 
ttaagagttt gatccagggc tggagagatg gctcagtggc taagagcacc aactgctctt 
ccagaggtrc tgagttcaat tcccagcaac cacatggtgg ctcacaatcg tctgtaatgg 
gatctgatgc cctcttctgg tgtgtctgaa gacagctaca gtgtaatcat ataaataaaa 
taaataattc tttaaaaaaa aagagtttga tccatttaca ctggacttct tgaggcagca 
ggatcataaa ttcaaggtga gcctgggtga actggcagaa gtggcagaag ctgtgtctca 
cactgctgta ttcatttcct cattgatctt cagaggtttg ctaacgggaa gtaagtggaa 
cagaaggttc agtattcttt ttttcccaat tctattcagt ctttagtagt agatccctca 
ttatctgaga tgcagagtcc cctttattcc tgtgaccatc tcgttgtttt tcagggagtg 
tctcattcaa ggcctaacac tgaggacatt tcactgtgct atgccaatcg ctctgcagcg 
ctcttccatc tgggtcagta tgaagtgagt attgaagaac ctggtgtcct gcctgtggct 
gcagtggaaa atgagctcct ctctgttctt ctgcacacat tgaaatcaac tagcttgcaa 
acactgacat ccacccagac ccattctctc ttctgactca tgtcacctct cataggtgac 
cacaaacaat atgtagttga caagaagtag ctatgtcatt gtccacagtg catggatttg 
ttccaatagg ccagcacttc tgtgtccata tcagctagat gtgctgctga tagtatttta 
gattccaaaa tgtgtccaga tattacctcc ttcgtttgtt tcttctaaat aagccaggca 
caaagacttg aaagatggct tgatggctct tccagaggct gggtttgatt cccagcccca 
acatagcagc tcacaatagg ctataacgtc atttccaagg ggtctgactt cctgttctgg 
cctctacagg cagaaagcac agacatacat gcaggcaaaa cacataaaca taattgaaag 
aagatattaa ataatagccc acgcttgagc ttattcctct gatgatacag ctctcctgaa 
gtcatcatgg gcagtgtaaa agtaaaggtg ccccgccctg cccaggggca tcagtgaggt 
agctgactgt gagtggcttt cttctcatcc cctaactgct gcaacatcat caactgtgga 
gcattatatc cgtggatttt aagttaggaa atgacaaaga " ttagatctat ggccaggcac 
tagtgcacgc ctttaatccc agcactcagg aagcagaagt aggtggatct ctgtcagttt 
gagtttacag agagtgtcta ggcagccagg gctatataga gaaattctat ctggaaagaa 
taaacaaatc agatctgtat ttcaggaaga tgcctatgag ctgtttgaca tgtgtgatag 
aggtccttaa ggacaggaaa gtattccaca tgtgtgcatt ttacagaaat tggttataca 
ctggagttaa ggactgttgg aatagttgaa tgttcacact cagtgttctt caaatcaaat 
agaagaacaa gaatctatct gggcatggtg atgcacaact gtattcctaa catgtagaag 
actgaggcat gctatgtgtt tgtggctaac ctgggctaca tagtcaatat tggacagtca 
gagctgctac taaaacctaa aaaacaaaaa tctaataatc tgggatttta tattttcctt 
tttttaaaaa aagagtgggc agaatgtctc tgattttgtt cagatggcca cagaacctag 
aaaaactgct gctgctgctg ctgctgcata gcacacagct aatatttgac tatacgtata 
taaattttgt tgtatacttt agctgtgctg tcaactttgg aaaaaaagta tcccagttta 
tcattttaaa ttggcactgt acagaaatta acagccatat tagtctagac acattaaact 
tcatttttcc atttatacag aagcaatgta ctgtattaaa tattcagtct tatctacagg 
ggtttgatta cagaaactat caaagtattc tctaaatgat gaaaaaagat tcaagaatct 
gactgtagat ccaaaggaca agtggagaaa aacttaggaa gaattttccc tttatcccct 
ccctatattg atcatctctt ttacttctaa taatagtggc catttattga acatacccag 
gagttccttt catcacttta gatatataat ttatctcatc ctaaaatgac ctgttgatga 
gtcatctctc tttcagatga gaaacaaaga cattgaaaaa tctaacttgc ctgcataaga 
tcacacctag cctcttactc actccatgaa tattcctttt ttttttttcc tttgagacag 
aatctcacta tgtagttctg gttgtcctag aactcaatat atagaccagg ctagcctcaa 
actcacagag atctgatagc ctctgcctgc cgagtgctag ggttaaatgg atgtgtcacc 
aagcccagca aaatttacct tcttaatttt ctaagacgtt tctcctctta aaaaatggaa 
ctattgagcc agtcatggtg agacaggctt aatctttaat ctcagcactt aggaggcaaa 
gacaggccta tgggttcggg gcagcctgat ctatagagag agttctatgg gttaaagttt 
agggttaaag ttttgagaca aaactttgtg tcaaaaacaa acaaacaaag ccagactgct 
taataagaca aatcagacat aatattataa acaagtatta gtgtcactca attaaaaagt 
cactcaggag gtgagatcaa tccccagcat aatgagggga ggaggggaga gaaggaaatg 
aatgggaggg gaggaggaag gaagagagaa aaaggaaaga tctcaaagca gaacacagga 
tgtaatttaa ggcctaagct ctcgactgaa gttgtccact tttaatgacc cttttcatgc 
tcatggtttt ctgtcttcgg tacatagtga agagtggaaa ccaagggtca tcctggaatt 
tccttttgtt ttcaggcatg tcttaaagac atagtggaag caggtatgca tgggtatcct 
gaaagactgc agcccaagat gatggtgcgt aagacagaat gcctggtgaa cctggggaga 
ctccaggagg caagacagac catcagtgat ctcgaaagca gcctcactgc caagccaacc 
ctggtgcttt cctcttacca gattctgcaa aggaatgtcc agcatctrgaa aataaagatc 
caagaaaagg agactctccc agaacccatc cctgcagctc tcaccaatgc cttcgaggat 
atagccctgg gggaagagaa cacacagatt tctggggcct ccctctctgt cagcttatgc 
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acacaccctt 
ctggtgaagg 
cattgccttg 
tgtctgaagc 
agccaggaat 
gggctgctcc 
agatttgaag 
acctgtttac 
agtgaagaga 
tatggaagta 
gaacacagat 
agcgtgcagg 
gcagatttga 
gcccaggcaa 
attggtattt 
aataaatcag 
agctttgagc 
tctaagacag 
tctcttaaaa 
caagaggcaa 
caggctacat 
caatagcaac 
gcttttttca 
catgtatgtg 
gccactgagc 
atttattcat 
ctaccatgtg 
tcctgaacca 
tattatatgt 
tagagatgat 
gcagtcagtg 
tggtttccat 
gaacttggtg 
aatgatggtg 
catactacac 
tggagagtga 
aagaaaagtc 
tttctccccc 
gccggaggat 
aggacgacat 
atgttttgcc 
gggcatcaga 
ggaatcaaac 
tttattacat 
cccccccccc 
tctagttaat 
cagatztaata 
tacactgtcc 
tgccaattta 
agcagtagga 
agttagtctc 
cctctcaggg 
gtgagtggaa 
atgggtgaga 
ctctgacaat 
ttattggcag 
ggacagaggt 
cccaggaatc 
tttcaccaag 
gagaaggaga 
tcaagacact 
tctgccttca 
tggctcggct 
ggtttggttt 



tgaaaggccg 
aagatgcttt 
agaacaagtg 
acactttggc 
gtatgcagca 
tcacactcgg 
atgttgatag 
ctgaaagcaa 
agagcaagat 
attataatgc 
tcatctgtgc 
cccaaacctt 
ctgtttgggg 
taacatccat 
caagagctaa 
tgacctaaaa 
acatttctgt 
ctttagctat 
acttgagcca 
aggcaggcag 
agtaagatcc 
aaattaattt 
gacacaccag 
gttgctggga 
catctctcca 
tttgtgtgtg 
tgtcttggag 
tctccctagc 
aagtacactg 
tgtgagtcac 
ctcttaactg 
tttatccaaa 
tgtgagcatg 
gcctgcctga 
actagctgag 
tcatctccac 
cttttaaggt 
tccctcattg 
ctttgtgaac 
agatataccc 
tgcatgtatg 
tctctcagaa 
caaggtcttc 
attttcctca 
ccccgagcag 
taagctgaaa 
ctctgttagc 
agattttgcc 
tacagtgttt 
tttgaaagct 
tgtggtacac 
tgtccagaag 
ttctgcagca 
gggccttctc 
cttagcctgt 
taaatcagag 
cattggctga 
tcaggtgctt 
gttatgtatg 
agccctgctg 
gatagacttt 
ttggctcttt 
agtaaaggtg 
ttgttttttt 



ccatctagtt 
tgtaagtgtc 
ggataccaga 
cacagtacct 
ggcatgggac 
ggtcttctgc 
agttgtaagg 
gaatctggtc 
aggtgaaccc 
tatcttcagc 
catcagtgtc 
aaagtcccct 
agcagccatg 
atgtcacaca 
tgtttaagga 
ggaaaacaca 
gaacccagag 
acagcacgtt 
gagccaggtg 
atccctgaat 
tgtttcaaat 
tttagatgta 
aagagggcat 
attgaactca 
gtccattaat 
gtatacatac 
atcaaactca 
catttctagt 
tagctgcctt 
catgtggttg 
ctgagctatc 
gatgccttta 
aagaccagag 
gattccagga 
ctgtgtgctc 
aggcaagcac 
ggtggtggtg 
tgatggcaca 
tggaggtcag 
tgtctcaaac 
tctatgcaca 
ctggaatttc 
tggaagagca 
attacatttc 
caagtattct 
gggagggagg 
ctaactacac 
atcctttttg 
ataggtattc 
gacatttatg 
attttggggt 
ctgctgcaca 
tcccggtcca 
caagggcctt 
gcatagtttt 
gttatccact 
aggataacgt 
gctgactggg 
gcagagggta 
ataagggaag 
gtccttaatt 
gagccactgc 
tttgcctcta 
gagacaaggt 



gccacaaaag 
cttatcccag 
gttaccagtg 
tgtggcagct 
ctctaccata 
catgttgccc 
atgctttgtg 
aaggcatttg 
ccaattcctg 
cttttgcccc 
tccgcactgt 
aagctgaaag 
ctgcgacaca 
ggtaagtcag 
gaaaaacact 
gccgtcttat 
cttgggaggt 
tgaggtcagc 
gtggtggcac 
ccagcctcgt 
aaataaatat 
tttatttatt 
tggatcccat 
acacctctgg 
taaaaattta 
tataatacag 
ggttcttagg 
attcttttct" 
caaataccgg 
ctgggatttg 
tccagcccca 
agggcctggg 
ttcagatccc 
caacggagac 
aagagagcag 
acacctgagc 
ttgttgtttg 
ttcctttaat 
cctgttctac 
agacaaaaat 
ttatgtctgg 
agacacttat 
gcaagtagtc 
caatgctatc 
tcattgctgg 
tagatgttgc 
tatagaagct 
ngtgtatatg 
tttgtgacgt 
tggcctatgg 
ctatctgcgg 
ctgggctgga 
gctgggagtg 
cttagtgtta 
ttattcgaga 
cttagggaag 
actgagatgc 
tttcttaggg 
taggggtttc 
tcccccattt 
agcagggccc 
tacaaaatat 
agcctgaagg 
ttctcagcat 



acattctccc 
gagaaatgcc 
gagacctcta 
gcagctatgc 
gcacagagtg 
tgagaatgac 
acgaggttgg 
attacacaag 
gatgcaatgt 
atactgaaaa 
gcagacaact 
cagtgacccc 
tgctacagct 
aaatggtttt 
ataaaggaag 
atatcattat 
ggagatagga 
ctgaactaca 
atgcatttaa 
ctatatagtg 
aacaaaaaca 
ttatgtatga 
tacagatggt 
aagagcagtc 
aaactagagt 
gttcataagt 
catgggagca 
tttgtcttga 
aaaggggaat 
aactcaggcc 
tttttagtat 
aagatggctc 
tagcacccag 
agacaggggc 
ccctggctta 
acacagacat 
ggggtctttt 
ctcacatctg 
atagcaagcc 
tatttatttt 
tgcccatgaa 
caagtactgc 
tttttttttt 
ccaaa'agtcc 
gccatctccc 
ccaaacttag 
tattctttag 
tctacagatc 
ggatctttta 
tcctgttaaa 
ttccgcatct 
aggatgaagt 
aatgctgggg 
cagctctagg 
tgagcttgta 
gagataggaa 
tcattagcac 
ggttgagagg 
aggctcccca 
tgagccactt 
aacaagtgtc 
ccaaatctgg 
cctgagtttt 
agccctgggt 



aggagaactg 
acgacctcat 
ctgtcaccga 
caagtattgc 
ttctcttggg 
tcttttagcc 
tagcacagac 
tcagggagag 
caatggaaag 
gcatagccca 
caaagctgac 
agggctgtgt 
gcagtgtaat 
tacttacatt 
cctggcatca 
gctattgaga 
tgattaggag 
tgagaacttg 
ttctagtact 
agatccccac 
gcaataataa 
gtacaccatt 
tgtgagccac 
ggtgctctta 
atttttaaac 
caattctctt 
agtatttact 
aagatttatt 
caggtcttgt 
ctccagaaga 
tcttattaag 
agtgggcatt 
gcagatgctg 
cctagctaac 
ctgtgcagga 
gcacaaagga 
gttttgtttt 
ggacaaagag 
catttcagcc 
atatatttga 
agccagaaga 
ctgagtgcta 
ttaatatttt 
cccataccct 
catctccttt 
gatttattga 
actttcacat 
ttaattcagc 
cccatcttaa 
tcacatttca 
cacacttttc 
ggagtccaga 
tcaggaggag 
caaaggcctt 
tgcatacact 
tacccaaggt 
ggggaggcat 
gtagcagtga 
gacaaagaag 
cagaaggcta 
tgttttcttt 
gccaggaagc 
cacttzgattt 
attctggaac 
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tcactctgta 
gcttagatta 
gatggctcag 
gcaaccacat 
ctgaagacag 
aagaacacta 
ttcattctag 
gaaaggccac 
ttctacctgt 
gatggttgtt 
gccaccacat 
agttacccat 
gcacatattt 
agcccagttt 
aaaaacaaac 
ttgggcgtgt 
ttgggaggca 
gaaaccctgt 
gttaatatga 
ggagacatfcc 
cnnnnnnnnn 
nnnnnnnnnn 
aaccgtcagg 
attaaagcac 
gagaggcaca 
ctgggagaag 
agcagctatt 
tctgcagggg 
ttgtgtctat 
gtgtgtgtgt 
gtcagtgctc 
ggccaggctc 
taggaaataa 
aattggattc 
tatgcagagc 
gactgggctt 
tgcccgtcct 
gctactgaaa 
cagggccctc 
ctcctgaggc 
ccatccaacc 
agaaacctct 
gctaatgtct 
tgtgttgcat 
tgattcttgc 
gcccagggac 
tttttttctt 
tagtttataa 
tttatatatt 
taagtggaca 
aaatgtcatc 
ccgggggctg 
gtggcatagg 
tctgagattg 
tccaaaacca 
agcttcttca 
cactgtgctg 
atacccttcc 
cggtggctga 
aaaaaaagaa 
agtgctgggn 
ggccaggagt 
acggatgggg 
gtcaaagctg 



gatcaggctc 
aagttgtgcg 
tgggtaagag 
ggtggctcac 
ctacgggtgt 
ttcacaaggt 
gggaaaaatc 
agcacgtgta 
ttctgaggca 
ctatttctgc 
aagcatggca 
caagccatct 
aatcccagca 
acaaatcaag 
atccttctgt 
caaaaatcac 
gaggcaggag 
ctcaaaaaaa 
aat'agcagaa 
catactgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcagaggca 
caaatcttgt 
catagcttcc 
cttgaagctc 
cagcagctgc 
aaagacaagg 
gtgggtgagg 
gtgtgacaga 
cctttggctc 
gcccacccca 
aaaacagggt 
agcatttctt 
acaagagact 
gcctgcaggc 
ggggtcatcc 
caccctcatc 
agtaggtcac 
cagagctcag 
cattggacta 
tccccagctc 
gcagcttcta 
ataaaaacga 
tgcttctctt 
tattgttgct 
gctagaagtt 
acatgataag 
tatatatata 
taggtataaa 
cactggcgat 
cctgagacgt 
gttggacagg 
ctggttgccc 
gcagggtcct 
aggggcgcat 
gagtggatac 
cgtctacggc 
aggcttcagc 
agaaagaaag 
gacggagccc 
ggggccccaa 
gtcctgcgag 
aacttgtgct 



aacttgaatt 
ccaacactgc 
cacccgactg 
aaccacctgt 
acttacatat 
acaacaacag 
tttttaaact 
tccaggttag 
gtctcttgtt 
ctctcatgtt 
ccagcactga 
tgctagcccc 
ctcaggaggc 
tcccagaata 
gttccagtca 
aagtagcaag 
gatctctgtg 
ccaagcaaaa 
ctcaagatag 
tatatgcaga 
nnnnnnnnnn 
nnnnnnnnnn 
ggatggcagt 
actaaaaaca 
gggcccttgc 
aagaactggc 
ggctcttggg 
agatggcact 
gaagtgccgt 
gacagaaagg 
ctcctgccaa 
ccaaagccca 
tcttgtccaa 
actcctcagt 
gtgcttgcca 
tgccttctgg 
aggaggagca 
tgctaaggcc 
tggccaaaga 
agcctccctg 
acactgtaga 
cgcctagctt 
cagtaggggt 
ggtgatgttc 
ccttcccgac 
ggctggggtt 
accgatacag 
acatacagtt 
tatatattat 
actgcacctt 
ctctcctttg 
caggaatgag 
gcaggttaac 
cagctggctc 
tcagcccttc 
gtcccgacac 
ggtccagagg 
tgtacatttt 
agccttttcc 
aaaagaagaa 
tgggtcctca 
gggccgtgcc 
cacacttgcc 
gatagtcggg 



cagagatctg 
ccacctaaaa 
ctcttccgaa 
gatgagatct 
aataataaat 
tgtactagga 
ttagtgtgta 
gagtcaactt 
tctgccctac 
gccctatgca 
gccaagggca 
agaaaatgta 
agaggcaggc 
gccaaggcta 
caatgactgc 
gtgtaatggg 
agtttagcac 
atagaattac 
ctaatgaaac 
gcctgatgct 
nnnnnnnnnn 
nnnnnnnnnn 
gatgtgtttt 
gccctctgct 
cctctgctct 
cgctgttgcc 
caagcttgga 
gagctgggta 
gtgtgtgtgt 
gagagtgcat 
aaagcatgct 
ggatctgcca 
gagaattttt 
atcctctctg 
gaaacccttg 
gctgacccct 
ggaccactgt 
actagatttt 
gcctagatgt 
gcctgcacaa 
cgctgccttg 
gcctccagca 
gacggggtgc 
taagtatcta 
tcttcccact 
ctgaacaaaa 
tccttaattg 
tggtaaggag 
gtgggtgtgt 
ctgtgtgact 
gtgattggtt 
aggcattacc 
taagtgatct 
tgggagagcc 
tctccaaatg 
cactaatgac 
cgctcggtca 
gggcttataa 
tgaaacccag 
aaaaattctg 
gcggagagtg 
tgaggatgcc 
cttcctcttg 
ggcatagtcc 



cctgcttcta 
aaaatatgag 
ggtccgaagt 
gatgccctct 
aaatcttaaa 
aactttaaaa 
agaaagagag 
ttcagaagcg 
actttgtaca 
tgctgagctt 
ggcatctggc 
tttttgacag 
agatctctgt 
catagagaaa 
tgtaacaata 
caggccttta 
agccagggct 
aagttaacca 
aaaggattct 
tgcctagcaa 
nnnnnnnnnn 
ngcctgggca 
ggggcacaag 
ctcgatctgc 
cccggtcagt 
tttgtgctga 
agtagctcgt 
gccaggatgt 
gtgtgtgtgt 
gttcttgtgt 
gtgttctcag 
ccccattaag 
attattattt 
gtagggaatt 
ggccaacagc 
gagtctggcc 
gatacagggt 
tttcccactg 
taaaactgca 
agtgctaaga 
tctgccagct 
ccctgacatc 
tccggggtgc 
agaatgttgg 
gacacctttg 
ttgccgcagt 
agcaaatata 
tgggttgggg 
gggcagagtg 
ctcattgcga 
cttggacccc 
cccatggata 
cagacaagag 
ttgttctgag 
accaggcttc 
tcactttgcg 
ggacagccga 
tccaccagga 
cagatcttcc 
ttccttgctg 
actgccagcc 
tgctgcaccc 
ggtcgtgcgg 
tgcagttcgg 



catcccgagt 
gggctggtga 
tcaaatccca 
tctggtgcat 
aaaaaaaaaa 
tagcccatca 
aggggctgta 
gagtctcccc 
tgaacttcaa 
actgatgcca 
tcacatggag 
gtgtggtggt 
gtctgaagcc 
ccctgttttg 
atatgaggat 
gtctcagcac 
gttacacaga 
gggtattggt 
attttataaa 
ctgaactaca 
nnnnnnnnnn 
agagtaagca 
gccccctttc 
agagatacga 
tcctggactc 
cgccagaacc 
tccttctcct 
gggaagtaaa 
gtgtgtgtgt 
gttcctgtga 
agtcaagctt 
aatgcggggt 
tttctctctt 
caggtctgtc 
cctattccct 
ctctgacctc 
tcctgagatg 
ccagactact 
gctaaagcct 
aggacattgt 
tgaaaacccc 
cacttctctc 
cagacaggcg 
tcccctgaag 
cccccagcaa 
tttttgtttt 
ggttcccgta 
gacctgtgca 
aggatatata 
gtacagttct 
agcaggtctg 
gggactgagg 
ccaagagtgc 
tcctgctcct 
cgcagagccc 
tgcctttgac 
gtgagacgtg 
agggtccaga 
acttaggaaa 
gacatgtggc 
ccagtatcca 
cactgggggc 
tgggcatgat 
taagctcttt 
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cccagagctc 
cttcttgtaa 
ggccccatgg 
agacttgatc 
cagcttcaga 
gccacggaac 
attcccattg 
ctcagtgagg 
cttgtggtga 
gccccggaag 
gagattgacc 
gtagagatac 
acacttatag 
cgttcgcagc 
ggtgttggga 
gctgaagtgg 
gtcctctggg 
gggcgcaggg 
gagcagcaac 
ggggctgccc 
aatgactctg 
cagggcattt 
tctctaattc 
ttgtgaacac 
tagctgcctt 
tatttaaata 
aataagcgcc 
gtttcctcat 
tgaggggctg 
tgccagatcg 
attgccaggc 
tgtagcttcc 
attttcggat 
ctcattatgg 
ccagcaggcg 
ggccaggcag 
gttcagctgg 
ttcttgctgc 
cccacttggg 
gacttctgac 
gctgttggga 
ggcttcagct 
aattgggttc 
taggacccct 
cggcccttag 
tgtctagatg 
ccagcctctg 
ctgccctgtg 
tccctgcttg 
cggtcccctg 
tttatgctgg 
cttttcttcc 
accattcgac 
ttcccctgtg 
ggaagtccct 
gctctattta 
agtttataaa 
actggggttt 
tgcagaacca 
caatggacca 
ctggaagcat 
tccacatgcc 
tgaggggctc 
ctgagcaagc 



accttagaga 
cctgacctgg 
gacggatggt 
tggtgaggag 
tcctggcctt 
cttcgcagcc 
aggcgaagga 
ctgttgttga 
acccactgta 
atgccttggc 
aggtccacaa 
tgcaggctgt 
aggtagaggg 
tgtcggttgt 
gcaatgaaag 
ccctgctgga 
atgccctccg 
tagcacacac 
agcagcaatt 
agaggaggtg 
ttggctcaca 
ggggccaagg 
catccgaaca 
aggcgagtcg 
agcaaggaat 
attcctcaaa 
tctttcacac 
gagttaaaca 
tgagtgacac 
aggctgcatg 
tgtggctgta 
ctctaattgg 
gtgtccagcc 
acgattgtca 
cgtcaggaag 
ccacgaccta 
gactcttggg 
tagctgtgct 
gtcccaggcc 
gccccccccc 
gttatgggct 
agaatgtttc 
agttgtgatt 
ttctgggccg 
acagcatgca 
ttactctgtc 
tcctagggat 
ccaccagcac 
gcttgcaccc 
ggaaagcctg 
atgagacaca 
ccggcctgca 
atttgtacag 
ttctcccacc 
• tggggagatt 
ggaaaaaaaa 
cataccgtgg 
tccctctaaa 
ggctcagttc 
caaggacctt 
actaatagga 
agggatgtta 
agccactctg 
tgtcctagtc 



tctggttccg 
agccaggcgg 
gctccttgcg 
acactggtcc 
gccgcagctc 
attcccacag 
actccaaggc 
agagaaagag 
gctggttctc 
ccaggctcca 
agatgtcatc 
gcaggccacc 
cgtgaagctt 
ctccaaggtc 
tgatgttgtt 
ggaaggtgat 
ggatggcagc 
agtctcgagg 
ccacacagca 
gagatagatg 
gaggttctgt 
cccacattat 
cttcttcaga 
gcctccggct 
gactctaaca 
ataaagaaag 
tgtggtacaa 
cagttcacag 
cagggctgac 
ctcctctccc 
aaaccgggcc 
gcattaatta 
tgtggtcggg 
ttacctgcct 
atggggacag 
tgtcgcctca 
agagccaggg 
cagagcaagt 
cagggactgc 
cattcccaac 
cccagagagt 
tagcacataa 
tcagagcaat 
ggccatgccc 
ggctataacc 
gatcttcctc 
cctgtgctgc 
tcacactggc 
tctcttccag 
gactctaggt 
gcagagcctt 
gctgcaggca 
ctcatcggag 
tactgtggct 
acccccctta 
aatcaaactg 
caagtggagg 
agggcaatgg 
ccctgttatc 
gtccttgtag 
tgtaggctca 
accttccaag 
catccatgtc 
agccctggcc 



gttcctgtgg 
atggccatgt 
ggcagccctg 
tgtgcagttc 
gggggtcgcg 
ggaacgtgcc 
caccaggggg 
ggtggttagc 
atgcagcagc 
tagcttgtta 
ttggaggtac 
aaagatgcct 
caccaggcct 
tagctcctcc 
ggagtagatc 
gcgattgttc 
aaagttgtgt 
acaaccacca 
ccctggtggg 
gggaacagag 
cctctgtatt 
ctcctcacct 
ctgtcagccc 
ctgggtccgg 
aagcaaatct 
cattgagtcc 
aaacttcaaa 
ggctgtgtgt 
gcacgaggct 
ccattcacac 
ttgcctcttg 
ggcattcgtt 
taataggcct 
ttttccaggg 
gctccaggcc 
gttggcctct 
cccctggggg 
ggaaggagca 
ccaggccccg 
agaaacagca 
ggcagctgct 
aaatcatcgc 
tatgacctca 
tgcaggccct 
atctctgcct 
ctcagcatcc 
agagggaggc 
tgcacagtcc 
gaaggccatt 
tcaaggacat 
cttcactggg 
ggtaggggtt 
ttrtacagagg 
gctctgtctc 
cataggggcc 
aaccaaacct 
aggcaggcgg 
ggggacaaag 
tcaaggctat 
gacagtcact 
ggacagctgg 
cctccatctc 
tttgaaagcc 
acttctggac 



ctggtgcagt 
gggtggcctt 
tcagaggtgg 
cggaagtcct 
caggggacag 
cggcagccac 
gccagacagt 
ctgtggaggt 
aaccggtcca 
ccatggagaa 
tcgatatggt 
gcgggcaggg 
tggaaggtct 
agatgcacaa 
cagagggtga 
tgcaggaaga 
gcctggcagc 
cccagaggta 
gagagacaga 
ggtggaatgg 
gtatgcaggg 
ctttagctct 
caccgcaggg 
ctctgccact 
ggagtcctga 
atggtaccaa 
cctacaactc 
acagacaagg 
tcccttgggg 
ccccccctcc 
actgtccaga 
aatggatcct 
atgctcatta 
tcacagctgc 
tacgggcgcc 
tgccccttct 
aatatgagct 
gggaccttct 
gcagagtagg 
tcgtaagttg 
tctcgtccct 
atataattta 
gcagcagggg 
ctgcagtgct 
caatttcctg 
tgggtgtggc 
acagtcggag 
acagacccac 
cttgccagaa 
tcatgatgct 
gggtcctgtg 
gcagtgggct 
gcttttgttg 
tgtgcacccc 
tccaaggaat 
caggtgtgga 
cagaacggca 
ggactctagc 
catcactagg 
tcctgcagtg 
tctctgtcct 
ttctaatggg 
agtggtatta 
tccttgcctg 



tcttgcctgc 
tgtccctgga 
taagcgtgtg 
ccaccctcag 
cagagctaga 
agtcccaagc 
caccctgcag 
catggaaagc 
ggttcaccag 
acaagtgact 
tgtcctgcaa 
cgctcagtcc 
cgggtgccag 
agccctcgaa 
ccatggcggg 
tgcgctcact 
tgacagtcat 
gctctccagc 
acagcagtga 
gggactggca 
gtcccttgga 
gtccctaaag 
tggaacacac 
cgctcactgt 
atgatcactt 
agcatgcctc 
ctccatggct 
cacatttctg 
ttcacagtac 
tgtgctggag 
gcatttcctc 
taaaataatt 
tggaagccgc 
cccaagtggc 
caccctgaag 
tttccagctt 
gagctgaatc 
gaccaggctt 
ttgccacctt 
acagctccca 
gtaatcaccc 
gtttttgcat 
tggcacagca 
ttctgcccac 
ctccaaaacg 
ctccagcctg 
ggaggggagc 
agctccaacc 
cctttcccaa 
tgccccacat 
aaaatgaaag 
tatcactaat 
taccctcaac 
aaaagaatct 
acaggctaca 
cttagtaacc 
tgaaggtagg 
caagacctga 
aggcctaagg 
aagtgctctt 
ttagtatttt 
gggggggtgt 
ctccaggacc 
agtcagtagg 
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tgccaatcct aggattgtca ccagcaggtt tcttcctagg gaggcaagca ctgtatcacc 
atggcgcctt ctatgccccc tctatgaggc ccttgggagc cccgccccac tgattgcctg 
attaatgtac caacaatgag gatggagcct ttgccatgca ttttaacatt gcaaattagc 
aggaatccaa gtctctgtgg aggggccctg cacctcttct gccagactca tcaagcgcct 
cttgggcagg gctgccttct acttgagggg gcggaaggga gaagacccag ttccactctc 
cttcccctcc aggaggtgcc cttcatcgtg ttctgcttcg ttactctcaa gcctccggcc 
tcccacgcac gtgagctccc aaggggctct acagcctccg tcattccttc ttccattcat 
acttgccccc tagtctagga gagccatgga agacagtgtg ggaagggctt gacaatgagc 
atcatgcccc atttgcatat gcggtggcaa taccctggtg ggtaccagga gagtataggg 
gaaattaaga gaggggccta aggaaagcct ctgctatccc tgggctacca gtcagcattg 
cttggtcact gatcccctct gtaacaccag cccttctgca acctgccaga gtttttgacc 
tttgaactag ggctgagaag ggtctgctct gttcagctgc cttggctggg aggggaatct 
gctcagacct cagcacacac tcaacagaag gcatgcaagc aagggagcta gcagtggcct 
tgggtcagct ggcaagcccc aaactcttcc tgccaagctg agcatgaaaa gccacctcac 
catggtccca tgggaccaga cctggtagga taggtggcaa ggctaaggca gcggaatagc 
atgtgcaaag gcactggggt gggaaagggc ctgtgcttct caccccctct aatggtgcag 
agcctccaag gaatactgta acctcagctc agctgggctc gggtggccag agagcttggc 
accagaacca gcatcaacag ggcctgtctg ctaaacccag acctcacaag ccagtttagt 
aggggccctg tagcaccctg gccaccagaa ctaacgagga agatctgacg ctgggaatat 
gtctttaatg aaaagccctt ccggaagcca catttgcaca gaagaaaatg aggtgcccag 
agcatcagtg ggctggttgc agctggagaa cacagcaggg ggacaggtcc taccaagcta 
ccctgccttc aggctggggc tctagccagc tccctgatgc ctggagtagg taaagcagcc 
tcgaaatggg ctgggtcagc tttttcaggc tccaaagggt caggacagct gctgcagctt 
agcacccaag ggggctgccc ctctacccct aagtagaggc atccccatgg cccctgggca 
ggtcagtggg tctctctgaa gctttgtagg ctgcttcttg gccatgtagc caatctctct 
ggccttcagt cctccctccc tgcccccagc ctggccagct gctcttctct gagcaatcga 
tgttaaccga atgctctctt gctgtgggga tggcggcctc aggccaggcc agctgcactc 
ctggggctgc tggcgcctca ggccacttgg cacttgtgcc " acttgtgttc taaacacagg 
ctccttcctg cc-cggccct gaagacaaga aagctggcca gggaacagct gggctcccat 
ctcagcctcc actgctgtgc agagcggccg gcagcctcct atccatggct gtgagtagaa 
cagggctgtg gagccagagg cctaagttga atcctggctg ctccttttaa tgcttgcagg 
agcctcgttr tcctcacctg caaaatgggg cagtcattgg aagctcagcc agtcctccag 
cccacagata ctggtaccca cctgcccttc ccacatctcc atctgtctaa ctgaaaccat 
ccaaaccaag ctcttctctt cctctccggc ctcctccgtt cacaacttct ccatcttggg 
taaagatggt ccctttacat cagctgcctg ggaggaacac ctcagaatca ccgtggctca 
ctctggggaa attctgctgg ctagatttta gaatgtatcc agtatctatc cacttatcat 
actcgctatt gctaccattc acccagtagc ctcctggatg ccctcccccc ccccccgcaa 
gccctgcctc ctcaccccta cacctccttc aacaggaact agggtagtcc agggaaagtg 
agtcaggaag ggctgctcct tagtctgcat cctccagagt gcccatctaa ctagaagctg 
ccccaggtct ttctcccagc ccagaggccc tgcctctccc tggctacaca gaccttgctg 
ccgtccctcc cacatgccag ccctcagcct ccctcctgcc tttgtccatg ctgttccatc 
tacctggacc ggttcccagt gtgtctgcag ggctgcttcc cagagaagcc actcttgagc 
agcgatttgc aaggagcttc tttcctggga cattttactc tacagcacgc gacctcttgg 
acagcacgac ataagtcact tgtttccttt atgtccgctg ccactttgtt ctgatgagtg 
tgctccctgc acacagtagg tgctcattaa cagttctggg ggagggaatt accttctcaa 
gtctctggag aattgaatga tgacactcag gaagccctag gctcaagcct ggggcctgga 
tcatagtagg tgctcgataa atgttggttg taattagtcc tgggagactc agagccttca 
ggagaacaga cacctgaact tggctcacat caagactcct taggcccatc aaggaagtga 
ccgtttgttg gatgagcact ctgagaagga cccaatacca gtcctttcct gggcaagggg 
aaatagactg ggactgggga gtttcccaag gtatgggatt ttcagcacta aactggaata 
atgctaaaga aaaaaaaaag ttagtcactc taaaaggggc caggaccaaa acctttcaaa 
cagaaatgtc tgggtttatg aagagaggaa gccagatatg gtggggcaca tctttaatcc 
aggtgcttgg gaggcaaaga cagatggagc tctgtgagtt tgaggccagc ctattctaca 
aagtgagttc taggacagcc aaggctacat agagaaaccc acttgacttg ccacccaaat 
taaaaatctt agtgggggag cagtcaagga ggaaacacac acacacacac acacacacac 
acacacacac acgttagtat aatatcatac tatggctctg tgcctgcagt ccaggaatga 
gggctgaact cagagtgtta gtgtgtgcta gtggatattt gagctctgta tttatgtgca 
tgtctgtgta gatgtgtacc tgaggtgttt atgtgtacac aggtgttggc ctgttgcata 
tggatggaga catggttgtg tttgctaggc atttgcatgt gtctgagttc atgcacataa 
actcacatct acctctggag actgagagtg acaacccagg gcccttttat cctgcagcac 
cccaggccca gcaccccgac ccagcatccc aggcccagca ctccagaccc agcaccccag 
gcccagcatc ccacgcccag catcccaggc ccagcacccc agacccagca tcccaggccc 
agcaccccag acccagcacc ccaggcccag catcccaggc ccagcacccc gacccagcat 
cccaggccca gcaccccagg tccaagcact caatgcccag caccccgacc cagcatccca 
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ggcccagcac 
cagcatccca 
atcccaggga 
cagggacagc 
tggaggcagc 
cctcattgct 
tcccagtacc 
gtctggggtt 
cagcacatcc 
tggagcctac 
tgggagtcat 
gtatgggaat 
ctgcacggga 
aactcaggat 
ccagggctaa 
tctgtcatcc 
ctacttctcc 
aatcttctgc 
tcccttttac 
tttctncncn 
nnnnnnnnnn 
aaattgaact 
ccttgtgatc 
tagtggttct 
aggcattgct 
tcttaagatg 
agcatgtgct 
acaaaaccaa 
attgggacac 
gaggatgaaa 
aaaatatata 
ttgactgtat 
cccaaaattc 
caggaatgct 
acacttccag 
atgtccttgg 
aagtatttat 
ctaggcctag 
cagcctttag 
tgattcagtc 
gtcttttctc 
tctaatctct 
ttctctcttc 
tgtgccgctc 
ctgtgctgca 
ctctcttctc 
tttgtctgcc 
tttaccttca 
tgagagtaaa 
tctccataaa 
ctgccatgtg 
aataagcatt 
cgactacatt 
agtgaatttt 
ttatattaag 
tttcttcata 
cttatggaaa 
cctcccatct 
tgaactcctc 
gagataaggg 
catgacccct 
ctagtgatgt 
agcctatgaa 
cgaccccaga 



cccaggccca 
ggcccagcat 
cagcacccca 
accccaggcc 
acatactgaa 
ttcagcacct 
aggattcccc 
ggcagcccca 
tgcccattag 
cctgtatagc 
ttcacctagg 
aggtaaggag 
gattgtgtca 
ggtccagaat 
gacccttcct 
ccatctgtgc 
cagcatccca 
taggtttgag 
caccaccact 
nnnnnnnnnn 
nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 



gcatccaagg 
cccagggaca 
ggcccagtat 
cagtatccca 
gatagggaat 
gctctcctca 
tcagctgagg 
tgctaattgg 
gcctaacctt 
tctgtattga 
caactccaga 
agcatagtga 
ctgtggttcc 
ccagagctct 
tggacggttt 
ctaacagctc 
tgcccgcacc 
ggccccaggt 
aatcctcttc 
nnnnnnnnnn 
nnnnnnnnnn 
acacaaatga 
agattaaacc 
cccaaggaag 
tgacaaaggc 
tgggtggagc 
cataaagatt 
aaagtactta 
ttctcgttct 
gaacaacgca 
gtaaaacata 
agaaccagca 
ggttcaatgc 
caatggattg 
caaggtgagg 
tcatccttga 
taactgtcat 
ttctagcttc 
tgagatcacc 
gggctcaaac 
aattgctctg 
acactctctt 
tgtaaagctc 
ctcagctcta 
cggtaccacc 
ggcagatcct 
acatcacttt 
ttaaaggtga 
aataaggctg 
ttagggttca 
tgcagagagt 
ttggattaga 
ttgagactgt 
aggctgtggt 
aaaaaagcag 
tttttatagt 
aaaattttta 
cccggaacac 
tcgaaccctc 
caacctcaaa 
gattcccttg 
tttccctgcc 
ggtcgtcgat 
atgttccatg 



cccagcatcc 
gcaccccagg 
cccagggaca 
gggacagcac 
gtctctgagg 
cactcggaat 
aatgggtagc 
actgacagtt 
gcctgcaggg 
ggcactcccc 
tgggcacaaa 
gcctggctgg 
agactgccaa 
taagagagca 
cttggctttc 
tctgtggaaa 
ccagctcact 
tgaggctgtg 
ctcctcttcc 
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gctgtaatga 
ccagcccttt 
gacagctgct 
tttctgaatg 
aattctctaa 
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atagctcatc* 
atcacacgac 
cacaatgaaa 
cataaacaca 
gagtgctatg 
agagttcata 
accaactgat 
cctaactgga 
tccttccctt 
cttggaggct 
catacaatct 
gtttcctgtt 
tcctctcccc 
cttggtctca 
gcttgttctg 
tcccggtaaa 
ctgcactgct 
tgtgtctccc 
atcctgtcaa 
caagcaggag 
gtactaaggg 
agccaactct 
caatacgatc 
taaggatgtg 
gtcattcctg 
ggagggctga 
tggactgtta 
agtagaaagg 
tagcttttta 
ctgaactctt 
tcctgaactc 
ccaactaaaa 
atgaaccggg 
gcagaacccc 
cagctctccc 
tctcctctac 
tgctttcttg 



cagggacagc 
cccagcatcc 
gcaccccagg 
cccaggcaca 
cctcttatct 
caaacaccct 
taccattttg 
tctcctgaga 
gtgtgctgta 
caagctatga 
aatctctcca 
gcacctgaga 
gacatcttgg 
gatgctgaga 
tactctgtcc 
acatgaggcg 
gtgtgccctc 
ggtttccctc 
tcctcttcct 
nnnnnnnnnn 
nnnnnnnnna 
cagaaaagga 
ttgcctttgt 
tggaatgttg 
catatggtag 
gagtctgtct 
gcagtatttg 
agtaggaaaa 
tattaagaag 
ttgaatagag 
taaaaagaat 
gtttataaat 
ctcttaggaa 
agattaataa 
ggaagtttgt 
tctccgcttc 
tactggctcc 
aatccaagcc 
ctttctgaac 
gatgatttta 
aactaactct 
tctttactgt 
cctgcctcct 
ctccacagct 
ttacgtagct 
acctttctct 
tgcctcctct 
tgtgtctctt 
agctagaaat 
aaatatcctg 
ctcttgaggt 
tgacactgtg 
gcgatggatt 
ctagctacgt 
acagttttgg 
tgttgctatg 
cttcacccca 
ttcaccccag 
actgttccaa 
tacattgcca 
ttgtcccttg 
cccttgagtt 
accactaggt 
ctgttgttct 



accccaggcc 
caggtccagc 
cccagtatcc 
gtatcccaca 
tggtccttac 
gtgcaggttc 
gcttttgtct 
gcaatttggg 
ggggcaggga 
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ataagggtag 
cctgagcagc 
ctttcacccc 
ggcacttaac 
tctgtcccag 
tatgagctct 
atgttactca 
catctgtccc 
cctcctcctc 
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caaaggacgc 
atactatttc 
ttttcatatt 
gagacctctc 
tatctattca 
gaacttctta 
caacagtgaa 
agaacaacaa 
attaaggtgt 
aataaagaaa 
agaaaactct 
gaaaagttcc 
gtaactaaat 
tctgaaggca 
caccagaggc 
ctggctgtaa 
atcagctaac 
tagaatgttc 
tctagctggc 
ttcacaatct 
agcaatcttt 
gtctagtttg 
cctccccctc 
ctcctgtatc 
tccctttcct 
gattcttcac 
acaaaccaac 
tttcagccag 
agtttctttt 
agacagctgg 
ttcagatggg 
acaaaggact 
aatacactgg 
ttagctggat 
cagaaggagg 
accaaaatac 
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aatgctttcc 
gaacattttt 
aataatagga 
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gcatgagttt 
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gccttctaca 
tgagtgcttg 
acagcgcgac 
ggtctgatgt 
gctcagtgag 
tgtccaccgt 
cctggtggac 
tcccacctcg 
agaccgtggg 
ctcgtgcaga 
tgtggtcgac 
ctcttgttgt 
tgactctgga 
gaaagggtcc 
cacctgaggg 
aggaaggggg 
tcccacctcc 
cccagtctga 
ttgacgacct 
ctgcagctgc 
ctcggagtcg 
tgccactcag 
agtactggcc 
cagagaaccc 
cttgggatga 
tcctcataga 
acctcgtgga 
gtaggggacg 
gacggcccac 
cagtcttcct 
cagaggggca 
agaaaaagct 
aagcagaaaa 
aagaaataga 
ttttggccgc 
acagggcagt 
attgcaaaga 
tcctaaccct 
taactttgtc 
cagtactcac 
ctggtagtaa 
tgacccactc 
taaccaaact 
aggcccctgt 
aacccaaaaa 
aagcaggaat 
ccacccccat 
ctcatatcca 
caccacttct 
gggaggttaa 
taagctctct 
tttgcctgcg 
agggcggaca 
ccaccctgtt 
agcttaccct 
gtcaccaggg 
ctaaaaaggc 
gcaaaagatg 
ccccacggca 
gctttgcaac 
ggaaagaaga 
cgctagcatt 
tagccagggg 
caaaaaaatt 



ttttgagtac 
agtgagggtc 
cacccagagg 
ttgtgttctg 
accgcgctcc 
ccgttcaccc 
ccctttggag 
tgcccagttg 
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gggtctcaat 
gcagagtcgc 
gtctactgtt 
gcattggaag 
gtggcagacc 
tgcttttgac 
tcaccctgat 
gtgggtcaag 
tgcagccgga 
cctctggata 
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agcacatgtg 
tttttcctct 
ctctggactt 
ttgtcagcag 
ggcgagaaaa 
cgaggctttc 
cctccttgtc 
caatttggct 
tgagcgtcta 
gagagccgct 
gcaaaggctg 
agtctatcat 
ggagagggaa 
agttgtgaat 
gaaaccgcaa 
gaaaggacac 
agaagatgat 
cgtggagggg 
taacccccta 
attttacccc 
ctttctggtg 
aaaggctcaa 
tgcctgcctt 
tgcagtctct 
ggggttggct 
ttcggtaaaa 
gaggttgctg 
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cccacctgag 
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gactggtcaa 
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gctcacagag 
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cgccgtttct 
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ccttggaccc 
aagtctggcc 
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ggaccaatag 
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agagggccag 
tctgatttat 
actgggctcc 
cttttgcagg 
aatgttctgg 
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aagagggaaa 
aatagacggg 
gatagacagt 
ggtggcaaaa 
tgggctagag 
tagggaagtc 
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ggcaagctag 
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atacctgagt 
gtccaattta 
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tcaggttggc 
aaacaagtgc 
caatacccca 
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aaaccaggga 
ctggacattc 
agaacatggt 
aagagtcagc 
ctaacttgga 
ctccatcggg 
gtagatgatc 
ctcctcacag 
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tttctgggga 
cctctatatc 
gcttttgagg 
actaagcctt 
caagcactgg 
gctagtggat 



tcttcttggg 
ggtctttcat 
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tggtgcgatc 
gcggggtgga 
cccaggaaaa 
catttggggt 
ggttcgagtc 
tggtcacgag 
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ggtttctttt 
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tcagagccca 
ccgagtggcc 
tcgctgccgt 
acattgtgac 
caaactcttc 
catcagcacc 
ctccccctta 
cgagaggggc 
aggaaggggg 
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ttgagtcact 
ttctttttac 
gagaggacgg 
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aggtcttgca 
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agataccact 
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acttcctgat 
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aacgagctct 
gccctgctcc 
cttcagaagg 
cagaaaaaga 
taactgcgtt 
ctccggttgt 
tgagcaagga 
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ccaatgacta 
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atacagtcct 
tcctgtttgc 
ctaggctacc 
atcttgcgcc 
tcttggtcgc 
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taaccttcct 
agactgttat 
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ggcccacatg 
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tttggtgcat 
aaggttcttt 
gcagtttcgg 
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ttgtcttagt 
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aacgtttaga 
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ctggcagaat 
gaaactgacg 
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tcagggaccg 
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taatgatctc 
aactaaccac 
tatgttctcc 
aacagaagaa 
cacacccact 
ggactacaac 
aggtctcaga 
ggggcagact 
cacccctttt 
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gctccaagat 
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ggagagaaat 
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gtctctaatg 
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aaaaatatag 
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tagccctgct 
cccctcatgc 
cgaatgacac 
atcctcaacc 
atggacatcc 
agagctccca 
ggagctgcgg 
tcggcacaaa 
atcgttaata 
atctacaggc 
attctggccc 
ggccacccaa 
aaaacaagtt 
tgaggatggg 
tttttgaaag 
accaattcac 
aagtccccaa 
tcactaatgc 
gagtgtattg 
tgttagtatt 
ctgcccagat 
aggtaatcgg 
ctcagttggg 
tagagaagat 
gaaaagactg 
actcctttrg 
gtttccgact 
gtgattagga 
gtaccccacg 
cttgagccac 
gttgacggaa 
gaagaaaacc 
ctgcgccgca 
gtcctgggag 
cctctggatt 
caattgggac 
cccaaatggg 
agctgcacat 
cgggggggca 
ctggaagccc 
gtcaaacgaa 
cagagcaccc 
cgagaacgga 
atacattcca 
ggcagtgaca 
tggctccccc 
taccctccct 
taggaacaga 
ctaaccctaa 
aaggaatagc 
gaaaaaacag 
tacctcagga 
gtcagtatct 
gtgtgtctat 
ccagactctt 
gaaaaagaga 
gagtaggtac 
gcagttatgg 
ttaacttccc 
aaaaaaagag 
ggagtaatca 
agaaaaagcc 
ctcctctcca 
tgcatcctta 



gatcaaagat 
cttagaaagt 
actatcagag 
cagctaccct 
tcgctgaaga 
gttggtacac 
tggtagacgg 
aggctgaact 
tctacactga 
agcgagggct 
tgttagaagc 
aaaggagaaa 
gctcaagggg 
aggtataaca 
aaaaatagaa 
ccacctggga 
cctgaagtca 
aactaaagcc 
ggaggtagac 
tgtagacacc 
tgtggccaag 
gtccgacaat 
catcgattgg 
aaataggacc 
ggtggctctc 
aagttctgta 
ctgaccctgt 
cccagatttg 
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aagattctat 
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gctgacaaac 
atcgtgcgac 
cctgctgcta 
tctccctcta 
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ccatgatctt 
taaaagagct 
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tacagagaac 
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ttttcaggat 
aagatccttg 
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cttcctcttg 
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tgacaatggg 
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gtactaaggc 
tatctcctaa 
cagaacccga 
agaatgcatt 
tagaacattt 
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ttttactata 
gtgcatgagt 
ctattaaatc 
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agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcattcc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 



aacaatatcg 
aagaagtggg 
cccctcccat 
cctgaactcc 
ttgagataag 
gacatgaccc 
ccctagtgat 
taagcctgta 
ttcgacccca 
ttgccttcta 
cttgagtgct 
atgtattctg 
gaacagctca 
acaggaaaca 
cgtcaggccc 
ccaagtatca 
gtgtgttcca 
agagctgcac 
tcaaggcaag 
ttcaaaacaa 
atgacctcaa 
gtgagatggc 
tggggacaca 
tatgcctccc 
tggtcatgca 
agatcgcata 
caaaaatcaa 
tgtagtaagg 
ttccgagtgc 
gggtacactg 
ctaactgctc 
tactgctcag 
gcagaattca 
cttaaaggtt 
tccaccctaa 
ccagcccatg 
tgttggtgag 
ctgaagacca 
cctcatggac 
cccagacaat 
cacacacctc 
ctggtctata 
aaaccaaaac 
gggtaccagg 
ctagtgaagc 
agctttcctt 
ttaagaggtt 
atgtcttgac 
tgctcctcac 
tgagaactgg 
tcacaccaga 
cactttgctg 
taacgactgc 
cagtctgaag 
attgcttgct 
aagaaatatg 
tattacagag 
atgttggtac 
tagtttgctt 
ctatggtctc 
caatgtaagc 
tctgtgatga 
ctcagtgact 
atggaaagat 



ggtccttcag 
gaatgaagaa 
ctagagattg 
tcaccctaga 
ggcctcctaa 
cttagttacg 
gtaaacttgt 
aaaaatttgg 
gagctctggt 
cattttgagt 
tgagtgaggg 
cctcatgggt 
acctgctggc 
gagagctgcc 
tacatcttaa 
aaccaccagt 
ggatgtggac 
acctttaatt 
cctgggctat 
acaaaaccaa 
atagccaaac 
tcaacatatg 
cataatggga 
acaaataaga 
tgcctataat 
gcaagcagta 
ttaagcaaca 
acaataccaa 
tcagggatgt 
ctctcagagc 
aggcattcag 
catggctgtg 
agagttatag 
ggatgcacca 
tccccaaaga 
gccttgaact 
gtattccttt 
gcagctctcc 
tcagccgcta 
actatgtaag 
tactcccagc 
gagtgaattc 
cgtaaattca 
ggttgaactc 
tgaggcaggt 
tgagagccta 
ctggacttgg 
aaaggagatc 
cactgcagga 
ggaaaactga 
cctaaccaac 
ggctcatcct 
taagaagcag 
tctttaaaat 
gtcacctatc 
gcatcaatac 
tgcaagggaa 
agttcatgaa 
ttttttttct 
aaactcacca 
atgtgccacc 
ttaacttcaa 
gggtgggcta 
caagtccatt 



gaggttgaaa 
taaaaatttt 
ttcccggaac 
gttcgaaccc 
aacaaccgca 
tagattccct 
actttccctg 
ctggtcgtcg 
ctatgttcca 
acggtctcag 
tctcccttcg 
tcagggggtt 
aggagggtgt 
tacagtctta 
aggatctaca 
ttgtagggga 
aaggagaaca 
ccatcactcg 
acaattctag 
accataaaaa 
agctctcctc 
aaagtccttc 
gaggaccaat 
aaatatatat 
ccagcactcc 
ggctagacag 
ataacagtaa 
aaatgttcct 
ataaatactt 
tgcccagaaa 
aggtcacttc 
gtacttggca 
catcatgaaa 
agtgcctttg 
cttctggcca 
cttgatggtc 
gctgataata 
aggagtcctc 
ctagattcgc 
ccaatcccat 
acgcaagagg 
caggccagcc 
ttctatcagc 
aaagtcttca 
acagccggtt 
caaagaactc 
atctgtagct 
acttgtgttt 
ctgcagatct 
ctttgtattt 
tgccaagagt 
catccttggc 
tctttccttc 
gtgtttgtta 
atattctatc 
ccactgcaaa 
aagaaaaaaa 
gaggttctac 
ctttccttcc 
ctgtaaatca 
aggcctggct 
tcatcaactt 
agggcatgct 
gtgagcagca 



actcgctcta 
tactgaactc 
actcctgaac 
tcccaactaa 
aaatgaaccg 
tggcagaacc 
cccagctctc 
attctcctct 
tgtgctttqt 
tgtcttcttg 
ggggtctttc 
tccctcagca 
gggaaaggac 
caggcctacc 
gtttattaaa 
taaaaataca 
cagttgttta 
taagagggaa 
attagccaga 
agcatttctg 
agtccaaacc 
ccaaaaatat 
ttctacaagt 
aataaaaaga 
agaggctgag 
ggctacatag 
• caaccacaac 
ttaaggacag 
agaaaacttc 
gttgtttatc 
tgtagtagcc 
ttatcatttt 
gtttccacca 
gggtgataaa 
tctcataatg 
ctgtctcagc 
caagcaaatt 
aatgcagact 
aacctattca 
tttaatacac 
cagaggcagg 
agggctacac 
tctatttcct 
tgtttacgta 
ctttactgct 
tttttttctt 
gtcatatcac 
gctgcagtgt 
attctgcctt 
gggctttgaa 
tctgctgact 
ctgcagttta 
cagaaatttt 
tggtagaata 
aggaagcaga 
aagtgtaaat 
tcagccagtt 
aaaatggctg 
ctttcctttc 
ccttgaactt 
cacacatttg 
gacataacca 
cataagggat 
acacgccctg 



agattagagc 
ttcttcaccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
ccttgtcccc 
cccccttgag 
acaccactag 
tgctgttgtt 
ggtccgcggc 
aaaactactt 
aattcaggga 
aagtgttcat 
accactgacc 
agaacactac 
aggaacacat 
aaagcttaac 
gcaggttcat 
gctacatcgt 
aggctttggg 
aaactgcaaa 
tgacaactgt 
taccctctga 
aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
agagaaaacc 
tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
aaataataaa 
tcagaattgt 
ggggtgggaa 
cttacaaggt 
ctgatccttc 
gtttttcaat 
agaatcgtct 
tatcctgatt 
aacagaagtc 
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ttctgaagta 
cgtgtttatt 
aacatgaact 
tatgctaaga 
agctttggaa 
atctgatttt 
agactcagaa 
tgtcattgtt 
ccaggctaga 
tttgagacag 
ctggcctcga 
gtgccaccac 
tgttcctgcc 
tagtaaacca 
cccaaccatt 
tttttcagct 
ggttctgatt 
tgcggtaagc 
ccaggtttgt 
cattggggca 
gatttggcaa 
caacacaaca 
aagagcaccg 
tcacaaccat 
tgtactcaca 
atgtacctca 
cattttctta 
cttgacacag 
ctctggtggc 
agaagcaaag 
tacatgcatg 
tcattaaaaa 
gctttgaaag 
aaccttttga 
taatggcagc 
tagtgttttt 
acctttgtgt 
ggcttacaca 
ccagctcact 
gggttcatgg 
ccatccttag 
ttagtttgat 
aacgagtttt 
agttatgatc 
tattgcctga 
ttgactggtt 
cctccagact 
tgtgatcatg 
gggtggaaag 
agaaatgtca 
tttatatagg 
aactctaagc 
acccttcaag 
tacgcacaag 
acacaactta 
acatgggaag 
nnnnnnnnnn 
nagagagaga 
gctgagtcag 
cgctgtccca 
gctgcctcat 
cagctgggcc 
gtgctttttt 
aatccttgca 



taagaggaga 
ctctgtctgt 
gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggtac 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 



aagcttgatg 
tcttgaccgt 
attgcaaggc 
cagcaataga 
ggctgttata 
tcatgtgttt 
gacttgaatt 
gtgcctcagc 
actcctctat 
tgtagccctg 
ctgcctgcct 
gaactcgtct 
atagtcaatt 
tctagccaaa 
tgtgagggga 
tgcaggtcafc 
aatgaggcaa 
tcttgcttgg 
taggcctctc 
aaacacggag 
ctcccttctc 
caagaattgc 
tgaaggttct 
atctgattac 
aaataaatct 
gcttcttcct 
tgcttctctg 
gttctttaaa 
atgactgatg 
ccaaagccgt 
acacacatac 
ggctcagcaa 
gatattaatt 
aaatataaca 
gtttcttttt 
atattccaca 
gattgaactc 
aggcttgfcat 
gtttaaaaaa 
ttafccttaaa 
aggagaaaaa 
tgaagatata 
tcaacaaact 
ggtgggtcag 
tgcatccaca 
ttttagtata 
atgttcttga 
cactcttgca 
ccagaggaag 
tagacctctg 
aggggaaaaa 
tgttgggggg 
gctggttatc 
tggatgggca 
ccatgaattt 
gnnnnnnnnn 
nnnnnnnnnn 
tcgttggttg 
gctgagacac 
tgggggccct 
tgaggttgtg 
aggggagtct 
aatgttccct 
agacaggcaa 



agagcaagca 
agatgtgatg 
aaacaaacct 
aatgaaactt 
attcactagg 
gtttctggat 
ccgttttaga 
tgaaaataca 
tttttgtttg 
actgtcctgg 
ctgcttccca 
attacacatt 
gattcctgtt 
gtctgcctat 
atccacagat 
gattcagtgg 
gcagagggca 
aggcttagtc 
tcttaaattc 
tgtgactcct 
cctagggtca 
cattagggct 
gagttcaaat 
ctcttctggt 
aaaaaaaaaa 
cccctgacta 
cttctaccct 
gcagggctca 
atctaaggtt 
cctctgacca 
acagaagtaa 
actttccgtg 
atagttattc 
gagtgtattt 
tagattattg 
gcatgtgtgt 
aagtcatcaa 
ctttggaagg 
agaaaaaaaa 
atcaactgca 
catacagttg 
atcagctcac 
acccaagctg 
aagcctaagc 
ttcacatcct 
tgtgctgctg 
ctgccacaga 
cccaaggctc 
aagaaagtca 
aagtctcaag 
attcttacag 
ggggtagtct 
caaaacaaac 
gattgcattt 
aagagagagc 
nnnnnnnnnn 
nnnnnnnnnn 
cttctaatca 
tgtccaccca 
tgaaatgaaa 
aaaatagccc 
ggtcctttgg 
gctttaggct 
gaatatcatc 



ggcaagcaag 
gctgtcttgg 
tttcctcttc 
agaacaggcc 
atagggacca 
atgatagatt 
gacattccag 
aagagtacag 
cttgtttgtt 
aactcacttt 
agtgctggga 
aacacccctc 
tatttaccac 
tagccatatt 
gctgtagaca 
gtcatgaaat 
tattgtcaca 
agtttctggc 
caaacatagt 
ttccccatct 
gttttacctc 
ggtgagatgg 
cccagcaacc 
gtgtctgaag 
aaaaaaagaa 
cactcacgct 
ggctccttct 
\ggaccagtga 
aagcctagaa 
ccatgtgtaa 
aaagagattt 
tagaaaacta 
attatagcag 
tttttgtaat 
ttattatttt 
ggaattggat 
gcttgcacag 
caagcatttt 
agtccttttt 
aacttttatc 
gatggcttta 
atcacactca 
agcagctcca 
gggcgtggct 
ttcctggtga 
aaacaatctt 
gcactaatat 
ggggaacatg 
gaggtgagac 
aacaaggcct 
gggtccaacg 
tccccaggga 
tggtcagtcc 
aggaatacac 
aaagcaaagt 
nnnnnnnnnn 
nnnnnnnnnn 
ttagaaaacc 
ctgtcccaga 
gtcaccagca 
tgcagtggtt 
agcagagtta 
cagtgctgtg 
cctgttttgc 



ccaggatcca 
cttcctggga 
caagttgctt 
cataactgcc 
ctggaagtga 
tatcaatttg 
agaaaactga 
ggaagaaagc 
tggttggttt 
gtagaccagg 
ttaaaggcgt 
tttaattaac 
atgtttctgt 
ttcatcagtt 
gctttagaca 
taatttagtg 
gcacagcaca 
tctaaacgcc 
tagacattac 
gcggttccca 
tcacactcca 
ctcagaggtt 
acatggtggc 
acagctacag 
ttgccattaa 
ggcctgagtc 
gctgcctatc 
gactgatcgg 
cccacgaggt 
actgcacatg 
aaattgaaaa 
gagtacttag 
agatttctaa 
gtaagtttac 
tattaattat 
ttctgcttcc 
taggtggtca 
acttcctgtg 
tgtttaatta 
tggtaaaaag 
tcctgcaggc 
agcctctgcc 
aacaacagcc 
acctcgctgc 
gtgtggttct 
tttgcctctg 
ttactgaaca 
gaggaagagg 
tgcatctcct 
gaaagtctta 
ttagacaaag 
acactcctct 
tgaagccata 
acatacacac 
gggaaggggt 
nnnnnnnnnn 
nnnnnnnnnn 
actctcttag 
gcaaggccat 
ggctccggaa 
cctgggcctg 
acagtcatca 
cgctacttct 
cctcaggcaa 
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attctgaagt 
attctcactc 
ccacctccca 
gagaatgaag 
tgaacataaa 
gagaaatgtc 
ccccaaacag 
actcatgttc 
gcaccaggag 
tcccacagtg 
cctgggcttg 
gggacgtaga 
aatgccagca 
acagactgag 
agatagtggg 
acaaaaatgg 
tttgtcacct 
gttcccagca 
ctacaaagta 
caatcaatcc 
cttccagagg 
ctccaatccc 
agcatacaat 
gtacagaaag 
tttcctgtct 
tggtgtagtc 
aacaggatca 
ccccggctgt 
caactgagga 
tctggaattt 
gcagaactgc 
gggtctgtgc 
gaaaaaaata 
gaggcaggac 
cctctgatca 
tccattagag 
agtatggtag 
agaagttcag 
gttttgtctt 
tttgctgcca 
atgacgcttg 
actaagtgga 
tcgttgggaa 
gaaagagcca 
ccaaagctgc 
cacaggaggc 
atgaaacgtc 
gggctggcag 
tgcttttcca 
gctggacttt 
ctggctctgc 
gagaggtttg 
ctcaaaggca 
ccaagcccta 
agggggctgg 
attccgttcc 
gaatctgttg 
cgcaggcaaa 
tcatgccagc 
cagcctgagc 
aaaacaaaaa 
caatggatcc 
ttctccatag 
agagttgagg 



ctggcaaatg 
tacttctgcc 
tgctcagggg 
tagaagccag 
gttcagttac 
aagtgctaca 
atacaaacct 
cagcccagtt 
gaaatataat 
actgtggata 
agatctagca 
ggcagaagga 
tttgagaggc 
ttccaagaca 
agagaattca 
agcatgaatg 
acataccaat 
tctgggaggt 
agttctagga 
atcagtcaat 
acctgggttt 
aagggacatg 
atatatatgg 
agatcaacta 
gacatctgac 
cctgtgaccc 
aatcccacag 
ctgtgcagca 
gtgcctcagt 
gtcatttgat 
agaggaaaag 
tttctcttac 
gatgcatcct 
tcatcatgca 
aatcagtttc 
gcctgcatgc 
tatctgttta 
agttatcctt 
aacaaaacga 
agtttgatga 
aactccacgc 
taaatgttaa 
gggaggggaa 
gtgtgccacc 
ccagtgtgcc 
cagcaggagc 
tttccctttc 
ggctcatcca 
aggtcttagt 
actctgctgt 
caaccaccat 
gtctggtcct 
agtgctggtc 
gaggacagac 
agaaatagct 
taacattttc 
ccctctctgg 
acactcatac 
gatgctctca 
tatagaatga 
caaatctgtg 
ctagagctct 
accacatttc 
tttacatggt 



aaagatgtgg 
tgtgccacct 
ctaggaagtg 
cttcaaatct 
atttggttta 
attcagtgga 
tcatcaaagt 
ggctccttct 
agatgtcctt 
cagcccagtt 
ccctaaggca 
tcagttcaag 
agaaacagat 
gccaaggcta 
aggttatctt 
cttgcaactg 
tatacaaatc 
agagatgggc 
cagccaaccc 
catgggctgg 
gattcccagc 
acaacttcta 
gtaaaatgct 
taatgaaaca 
ttcttgcgtg 
aggattaggc 
tgctccactg 
ccgcccagat 
gctcctcagg 
atttccagac 
gggcactact 
aaatcattgc 
cacagtacag 
gcctctttcc 
aggcctggaa 
acgaagctct 
gcatggtgtg 
gcacttacag 
aacaagaggg 
cctgagtttg 
acatgccaca 
aaataaataa 
aaggcaggcg 
aaagctgccc 
accaaagctg 
agccaggttc 
ttcagaccac 
ctctagtgtg 
gtctaattaa 
aaggactttg 
gtcagtgggt 
tgtctaatgc 
cagcatgtgt 
atgctttaaa 
cagcagttaa 
atggtggctc 
ttttctcagg 
atatgaaaca 
gcactcagat 
gatgctgtct 
ggcttaatca 
gctttgcccc 
tattttgggg 
catttttttt 



gatttgaaca 
tcctctcatg 
gggagaagat 
aaactaagca 
aaaaaaaaaa 
tgtggatgaa 
ctcttccaaa 
catgtgagct 
tttaaggggg 
agtagagttc 
tagtggtaca 
gtcagatgag 
gaatttgtga 
cacagagaaa 
ggactgcata 
tggacaatat 
agattcagct 
agagctctat 
tacagagaga 
agagatagct 
acccacatgg 
ctggtctctt 
atatatataa 
accataacac 
acccagtgcc 
aaccactgcc 
taccccgcca 
gcttggtgtc 
gcagtgtgca 
tccagctgac 
gtgactgtta 
ggcctctctt 
gatggcaggt 
ctcaactaca 
agaaacggcc 
ggatttgatc 
gaggtatacc 
taagttcaaa 
gccggggaga 
gtcccttgag 
gcccacgcat 
atcaaaggaa 
agggagatag 
agtgtgccac 
cccagacttg 
cacctcagag 
agcagtgaca 
cctgtggcca 
ggttagcagc 
ggcattgttc 
cttcagaggt 
aaaatgcctg 
ggaatcctaa 
aaaaagtcat 
gagcactagc 
aaagctgtct 
caccaagaac 
ttttttataa 
ggcagaggca 
caaaaagaaa 
ttcctagcca 
ctggtctgga 
accactctgg 
gttgttacag 



cagacttgtt 
cacggggagg 
ggatgtcctc 
atgttttatt 
tccctacaca 



acaatcaaaa 
ggctgggtct 
ccgacttcca 
gtggggtctg 
tcgcctagca 
tgcccatgac 
gggtggggag 
gttcaaggcc 
ccctgtcttg 
agactttgat 
tgggttcata 
gggcccactg 
gactttaagg 
tacactactt 
cagtgatcaa 
cagctcacag 
tggtcaacag 
aaatcagatt 
atactgtttt 
tccacaatga 
cctgggaagc 
gaggacgtga 
actgagtgac 
attgaaactt 
cttggataac 
tcccctgcat 
cacttccctc 



acgaggcggg 
ccgccgcccc 
actcaggctg 
ttcagcacgg 
tatctctgca 
gctagcttgg 
tgcttcgctg 
cttatggtgg 
gaatgtgtat 
tggccactca 
ataaccctga 
caaagctgcc 
attacagatt 
gtggagccac 
gctgtcctgc 
gaacaggcct 
caaattggag 
cattccgtga 
agaagaactc 
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gttcaatacc 
gctttaaaaa 
tgctctttca 
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acatgtggtg 
acctgctcgg 
gggggatttt 
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gaaggagctg 
gagcttgctc 
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aaagcagggt 
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gacaagcaga 
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agcctccctt 
catcggagcc 
gcatgtgatt 
agcacagagg 
gattatagaa 
ctttgaagtc 
ttccgggact 
tgcagagttc 
gggatgtggt 
gcataagccc 
cagggagacc 
gcaacatgaa 
tgctgagtca 
aaggagagaa 
acacacacac 
aaatctacca 
tatgaacacg 
cagtgtgcca 
tggccaggga 
aaacctggaa 
agagtctgga 
cagtcacagg 
agagaagggt 
tcaaatacca 
atcctttttt 
ttaatgtcaa 
catcagagcc 
aattctgttg 
gaggacccag 
tcctagaggg 
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tcacaaaagg 

aaccgaatcc 

ttatggcaac 

tgcaaagtct 

gtcataaata 

ctaatgagga 

ttctaatata 

taaaatgata 

atccaaagca 

tctcatccgc 

taggacagac 

tattaggctt 

aaaaagcaac 

ttgcataatg 

gaacccggca 

ttgcagggag 

gacctgactt 

ggattgctct 

agcatggatc 

tgcataccag 

aaccacccaa 

ggattcataa 

tacccaaaat 

tttggaaaac 

gcaccaagac 

gccgctcacc 

cacagtcgca 

tttgtgaaac 

gtaattcaaa 

tgcctactga 

taacttccag 

gacccccatc 

agggtcttgt 

gcattattta 

catttcctct 

tatccttgat 

tttctgggtt 

accaacatga 

acaggcatgc 

cagagtaatt 

aatgagttct 

gggtgatgtg 

tggggagctg 

ggtacactcc 

catggcgggt 

aagtttgaca 

ccacaccacc 

caacaagaga 

ttgtgaggaa 

ttagtcggtg 

gaacaccagc 

ccgaattcca 

tggcgctcag 

tatcccctgt 

ctcccctcct 

gaccccctcc 

tttatcaagt 

gtttctattt 

actgagggtc 

accacgcacc 

ctcgcccaaa 

aaaagctacc 

acagggcggg 

tcaatattta 



aaaataggaa 

taaaaggaat 

tgacatttaa 

tgcaggaact 

tagcttattc 

aatatttcca 

atttatcaat 

aatagcaata 

gtgtagctat 

agatacagtc 

gaaggggggc 

tgttataaga 

tacttggggt 

aacaattttc 

caggctcata 

agaaagcagt 

taaacagttt 

caaagactct 

tccggccctt 

gaggactgaa 

actcatttcc 

aataaggcag 

ggaactgcat 

tggcacagct 

ccagggctga 

aatcccggct 

ctatgctctc 

tgttttcgaa 

actaattacc 

ttgtgcgctg 

gctaaatcag 

cctgacttcc 

agttctggct 

atttaggcta 

tcttcaattc 

cgtcccaccc 

tgcaaggtgg 

gcggcgcaca 

acacaccggc 

aggcctaggc 

cttcctttgg 

gctgaaacca 

cagtcggaag 

cagaaataga 

ttttagacat 

gagtgatatt 

gccacccaag 

gggagccggt 

ttgagtgtac 

aagcaggggt 

tcgaggaaca 

cagggcttaa 

caatctcatt 

atctgactca 

cctgcctatg 

actggactct 

gttctgttgg 

cctttacatt 

ctggtagttt 

aagcaaaaat 

gccagattta 

acttggcaga 

atggggcggg 

acctaactgg 



aatatgcctc 

gttcaagtaa 

aagtaacagg 

tcatgctaaa 

ccaaatcctt 

caaaaaccca 

ttaaacacat 

gtggtgaggg 

atttciaatac 

tagactcaga 

aataagaaat 

ttacatcaaa 

ttgcgcctgg 

attcaatcag 

gtttttatct 

cgggaggcat 

ctctctggag 

cggactcctg 

ctcactgctt 

gtttggctcc 

gacttggttg 

aggcagagaa 

tttcatgcac 

gacattggct 

ggcctaatcc 

ggagccgcag 

ccgtggggag 

aacctgggat 

ctgagcacat 

cagctcactc 

gcagccacag 
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ttaggaatat 

accgggttta 

ttatcacatg 

ctatctttgt 

aacactccaa 

aactgcgctc 

ccaaaaaggc 

gtattaagcc 

tattatttct 

cattctcctg 

catgctfctgc 

ctcaccccac 

tgagccttct 

tcttgtagcg 

tagagttgga 

tgcacccacc 

ttgctcctgc 

agatcaagtc 

tagggtggag 

taatttgctg 
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atttgggcaa 

attatgcttt 

agaatttcta 

cttaccatta 
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agccataatg 
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gagcagtctg 
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ccaaagaaat 

ctacaaagat 

ctcagtaagc 

gtcttcacaa 

atacttggaa 

cttgcattaa 

tctcacatgg 

gttagatgac 

cttccctcag 

tgtaaagtgt 

agaagtgcag 

cgaataattg 

tgacaattac 



ccgaggcaaa 

atatggctca 

gtttctttct 

aattttaatt 

ccctgcaagg 

agaaccccca 

gccactctgt 

aaaaccagga 

tgtagaggaa 

cttgactgta 

ttcagaagaa 

tgttttaatc 

tgtgtactga 

gagatagctc 

cctgctggag 

gctgtgtacc 

gctctgtaag 

gtaaggtcct 

gggatttaga 

aggacaaggt 

caagtttccc 

tgtgtgtgtg 

ttaaagactg 

acggccaggg 

gagggtttag 

ttcacttggc 

gagggggagc 

atgtttaaaa 

tatgccatca 

ataaactgct 

cagccctggg 

tctccagaga 

cgtgttcata 

gagtcttcct 

ggtccaaata 

tgggagccaa 

aaggtgtgtg 

acgtgccctg 

tccccagcaa 

ttcccccttt 

agtggctaga 

ggcagaagct 

gaaatgagca 

attagcgctt 

tggattccca 

ttaagagact 

caagagaatt 

tgagcagccc 
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caaggtctag 
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catcctggtt 

cgacagctgg 
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tgcgaagttg 

ccaccctctc 

agccaaatag 
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actatgcaag 

ttaataacac 

cagcatatgg 

ttcaattcgt 

catttgactg 

gcttcccaca 

aactttaata 

gagcgtgaga 

tgagacaaaa 

atgggagttt 

ttagtggttc 

tatggagatg 

gtagaagtgg 

tgcaccccaa 

aaattttagg 

tcgagttgag 

ttttttttta 

tggctccctg 

aaccaccgaa 

tacaagctct 

aattaccgac 

ttaaaagttt 

attagggact 

aattatacat 

aggttatgaa 

cagctccaag 

ggatcactta 

ccatcttccc 

gggtctctcc 

tttttttttt 

aactcacttt 

ctcctgagtg 

tgaagttccc 

tccggctcag 

aacccccaca 

gcaaggctac 

tgtcggaggg 

tgtcctttgc 

gccttggtgg 

atgcctccct 

atttacctgc 

agagaccctc 

gcccatcaga 

gtgattcagc 

gccactttta 

gggtatataa 

cctcacgcct 

actaagtgtg 

ggtgcctggt 

gttatgtcac 

tgctctcgta 

gaactgtaag 

aaagtaactt 

ttgtgtctgt 

ccactgacat 

gtcattcaga 

cctagggtac 

ttgtggatta 

acatcaacag 

cttgaacaaa 

tccggccttt 

ttatttcatt 

ctaaggttac 

tgagatgaag 



aatgttgctc 

acagaatgag 

ctgtgcacat 

cctcagatag 

acaggcatct 

gcccggcttt 

aggattttct 

caggaatggg 

gaatattaat 

tgaggcatgg 

atttatggag 

tagcttataa 

tgggggtggg 

gcctatgttt 

ccaatagaaa 

cagtggtgtc 

aagtccactt 

cagcgagagg 

agggaactgc 

gagcaagcta 

tgggtaatcc 

acacaatcac 

ttttgggtgg 

tgagatttgt 

atggccaggt 

tcataagctc 

ggctacccac 

aacatggctt 

aatgtggact 

tntttggttt 

gtagaccagg 

ctggaattaa 

agtgacctgg 

aattttgggg 

ttgggagagg 

acagcaagaa 

tgtgaggcct 

tccattactt 

ctgcaatctg 

ccccaccctc 

cagcctttgg 

acagaagcac 

agaacccagg 

cccataaatt 

ttgaaataga 

gagataatcc 

gcctgcaagg 

gaggtatgga 

tggtggcacc 

tgggfcatggg 

aggttcctgc 

ccacttactc 

gtgtggcagc 

ctgtctgtct 

gtctcagtcc 

ctggttttga 

tcgaccagac 

tttgtgtata 

tatttagcat 

tgaaaatcaa 

taatacagtt 

gcttaaccag 

ccaaccacgc 

aacaatcaac 



tctaattaag 

ccttggctcc 

aggccgtgag 

agcccagaga 

tcccggaaag 

caaccaggaa 

cacagaaaga 

gacacaagcc 

atcttggcta 

ctaagctggg 

ggcttgctgc 

aactactcag 

gtggggcaag 

agggttgatc 

tgacctctgc 

aacggagaga 

gcttgggact 

ctccagtttt 

acagcacaca 

tctgaagaag 

actagggagc 

ttttgagttg 

tttactcgag 

caactgctgg 

aaggttgggt 

aaacaagtct 

ggacaagcac 

ctatgttgct 

caaatcatga 

ttcgagacag 

ctggcctcgg 

aggtgtatgc 

atgtcaactg 

ggacaaggta 

cagaggcaga 

gctgtctcaa 

tgggggccag 

aatcagaatc 

catttaagag 

cctgcatccc 

tgggacacag 

ctgaatgtac 

cttatgctaa 

tacacatccg 

aaagatgcct 

catgttgtct 

tagccaccaa 

taaaaaaaaa 

gtttggggag 

ctttgagagt 

caccatgttt 

tcaggtctct 

cagctaagca 

gtcattccat 

tcttggtatc 

gcttgcaatc 

tgggagtagc 

aagatctgat 

caacttaata 

acagcagtac 

cctcgtgttg 

agttaactgg 

taggaaggag 

ataacaggga 



agggctctgc 

gggagctaaa 

tgatgcagcc 

gcgcggctca 

cctgcgcgtg 

ggcttggcgt 

aaagtccatg 

aacaccccaa 

tgaaggagga 

caaaccattt 

ccggagagcc 

attttaaaca 

agaacaattg 

agcttcccga 

accacggctg 

gcagaggaaa 

cacctgaagg 

cccaggcacc 

agttaaatat 

ctgtcatttt 

aggtagtttt 

actataagta 

gctgcaacca 

ggagtagtgg 

gcttccaagt 

tcaaggggcc 

ttctcataca ■ 

tcaacaacca 

ctacagcntg 

ggtttctcca 

actcagaaac 

taccacgccc 

aagttggatt 

ccttgattta 

ggcagaggca 

aaccaaagac 

aacaaggtgg 

gccatcacag 

ctaagtggga 
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gcttagtggc 

tttcagcgct 
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attctggagt 

ttgatgtggt 
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aactcatttg 
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aagggttaat 

aggaaagggc 

cagagcagtc 



atttcctagt 

ggttccatta 

cagttaagcc 

ggccctcacg 

cctacactgc 

gggtctgatc 

ggaacaaatc 

ttgctaggct 

tggtgccat;c 

tctttttttt 

catcagaaga 

aacagtgcag 

catctgcaga 

ggcaagccca 

actgaagcta 

gtccaatcag 

cagggcattg 

agcccatcgg 

aggctgggtt 
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ggaagaacag 

tttcacacga 

acaatgagtg 

agggtcctgg 

ctcaaatata 
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gggcagggtg 
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ttactgtgat 
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agtagaagca 
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gctaagtggt 

gctagggaga 
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ggagcacggg 
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ccacgagccc 

aaatggacct 

cttcaagagt 
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aattaggggt 

tttttttttt 

gctgtcctgg 
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gagagagagg 
aacacaagtc 
attaattatg 
tatttatact 
caatttgacc 
aatggaaggc 
tgccttttac 
tagagcccct 
attgagctct 
gtccaagaga 
aatccctcac 
tttttgagta 
ccacgggttt 
ggattggttc 
tgaccgaatg 
ccatctctaa 
aacgctttgt 
gaatcaagtg 
tactaaacca 
aaagatgcca 
ggagacagaa 
aagctaccca 
cttgtatttc 
acatagggaa 
agggtgcatc 
gtgagggccg 
tgggtggccc 
gtttccaagt 
ccagtatcaa 
acacacagag 
ggctgccaca 
cagcagcagc 
agcaactgaa 
tgcacacatc 
gcagagatgt 
ttaggaacac 
gttttcttcc 
taaacatcaa 
ccttttgtgt 
atctccaggc 
tatttattat 
aagtaggatc 
tcttaaattc 
ggtaactgat 
tgtctgcagg 
caagtaactg 
ttccccaagg 
taatatcaga 
cagatattaa 
caaccatgca 
cacctgggtt 
cactggaaaa 
acttgtgaca 
aggcaggagg 
acacacagcc 
tgggttgcca 
aagccattct 
tcacatgttc 
cttcacgtcc 
tttgtgagtg 
tcaatgagtc 
fccctggagtt 
tcagacctag 
agcagacagg 



ctgagtttct 
aatataaacc 
tagtcaatag 
tttcgaagac 
ctgtagttca 
ataaaacact 
aaagagactg 
ccctcccctc 
aagtgataga 
attcaaagaa 
aggcgttaac 
gggaaatctg 
ctacatgcag 
agcgacctca 
ctgtgagcag 
cctgcaattt 
aagcaaggtc 
taactcaatc 
gtgagcctca 
acagctaagg 
acaggaggat 
cttgaagact 
tagtgtatga 
gatggtgcct 
gatctaaatc 
ttccagagat 
aggggtccag 
attaggacca 
gcttcaggcc 
cactagcatt 
gtgagagtcc 
catttctcca 
acatggggtg 
acaggccagg 
gcagcagatg 
actgagcagg 
tcgatcacgg 
tccaaccccg 
tttcattcat 
ctcagtctaa 
gaaggctaat 
cacaaagata 
tatttaacat 
gcaaactgaa 
ctttgtcgga 
accccactat 
ggacagcaag 
gtcatctgac 
cgataggcct 
gtaactctgc 
ctcccctact 
gcacactcca 
taatttctgt 

aggggactgc 
ctctgagctc 
gcctccccta 
atttctgttc 
ttcctgatgt 
atcacagtgg 
gcagagcctg 
cagcaggaag 
gggccagatc 
catcctcccg 
taactcaccc 



acttctataa 
tttagacatg 
ccatgggttt 
aagccactca 
aaaaaagtca 
ggaaatatgg 
gagggaattc 
agctgcttaa 
tgagaatcag 
ctgtgggcca 
agtggaacca 
ttaaacatcg 
ctgtgctaga 
gaggacattc 
gagatatcaa 
gtcaatctca 
agatagaggc 
cctatctcct 
agcaaagcct 
gccagggatg 
caggacttca 
gtccccaaca 
gacgaaggaa 
caaaaacagg 
cacatacctg 
ttaataaaga 
actgaataaa 
cagcctggcc 
agcggcaggg 
cacctcctgg 
cctaccttcc 
gtaacttgct 
ctgtgacccc 
cctgaagatg 
ccaagaggtg 
catgctcttg 
tctccacgtt 
aggggccaga 
ttaaagatgc 
tctctgagtc 
taattgcttt 
tactgtttat 
agctttaata 
gccatgctct 
tttactaaga 
gtggataatg 
ccagtgctta 
tcagtctatt 
ttgattaata 
aaaacagacc 
cttctccccc 
ggttacataa 
tfcgaagataa 
tgccacacta 
actgtctttg 
gaggatcttc 
ttctctcttc 
tcgcgatcag 
ccttaattcg 
gcgtgcagcg 
aaggaaagtt 
ctcttccaca 
gagaagcctc 
taaagcatca 



ataaaccctt 
gggctgccaa 
cattagcgta 
gggaaaaaat 
gaacagcaca 
acagaaatca 
agaaactatt 
cactggggtt 
aacaggaaca 
gaatctactt 
atttccaagg 
tcccacgagg 
tctgctgaag 
ttgttcacta 
aggccggcta 
gacagcaatg 
tccataaaaa 
gagattaggg 
gtctgttctc 
tagtgcaggg 
aggtagtttg 
aaacaaacaa 
gaaagctcaa 
acagccgagg 
gatttaaagt 
cccaccctga 
gggaaaatgg 
cctggcatgt 
cactggacag 
tctcttcttg 
tcaccgttaa 
ttccacagat 
ttggcaccac 
ctgggggact 
ggctgcagca 
ccgaatgaaa 
tcagagttgg 
tcatcggtgt 
attccagggt 
tgtaatgagt 
ccagttacaa 
tcaaacaaag 
aaggtacaca 
gtagcagcct 
ttctgttatc 
aagtaaatta 
tcagccgtcc 
aaacctatca 
attctacctt 
ctttgattcc 
caggaggaac 
tttgcctcat 
caaaatttca 
ccggtggctg 
cttatcagtg 
attgtggagc 
cgttctggcc 
ccgtctgcca 
agattcatcc 
gaagagagaa 
tacaagtctc 
gaccctttcc 
tgtctttcta 
ctttcatcta 



ggcaggcgga 
acttcacttt 
ttaaatacca 
ggtgggggga 
ctagagatta 
gatccctgcc 
taaaataaag 
gtggtggacg 
gtgtttttga 
aggcag.tcct 
cagccctgct 
gagcccagct 
tggccggtga 
gccctcgtgc 
ctggactgaa 
aagactgtga 
ttgttcaggg 
aagggaagga 
agcaaggtga 
tgctgtgata 
ggctataaaa 
gctgggtatg 
gcccgagacc 
agcagacaga 
aatatctggg 
ctgagtatgg 
• gaggaagttc 
gctggccagc 
ttcccacaca 
acaacagata 
ggatggtaca 
actgttatag 
aaagccatgg 
gcaatgctgc 
accagagata 
agcctcgcag 
cttggtgtta 
tcctgggctc 
tgcaaacatt 
taacatcttt 
gaatccttta 
caaaggaaac 
ggtccgcctg 
ggatgtccca 
ttcaaacagg 
tgcaatttgg 
tcagaggaga 
aaccctgaag 
gttgccattc 
aggcagacgc 
tcaagacaaa 
tatccagagt 
tgaaatccga 
agaactggag 
agtcccaaga 
tgtcccatgg 
accagtggta 
tagtctctga 
tttagaaaag 
ctttctttgc 
agagagaaag 
ccatcctagg 
tgggtgcagt 
gaggagctct 



tcactaaagg 
tcgacagtat 
cgatcaatat 
ggaggaggaa 
gcaagggttt 
ttcatttttc 
gcaaaatgat 
caaaataagc 
ggcaaaatat 
ctgggacccg 
ggtgatctga 
ctttcactcc 
ggaggtgtgg 
actggggcga 
aactagatca 
ttttctagtc 
ttcaggcaga 
aggctgtgtc 
gccacccacc 
tcaacagctg 
tataagcttg 
gtggtcgatg 
tgcttgggtt 
cagggcagac 
agactggtct 
gcaacaccca 
agctggtagt 
tagtctagct 
cgacacacac 
aaatgtaact 
aactgtgagc 
cactaagaaa 
caagctgaag 
ctggattctg 
attaatatga 
tgtaatgact 
ggctgccgcg 
aatcgccttt 
agtgagaafcc 
ccctagtgaa 
cagtcaaaga 
aaagcttctt 
gcaaccgaac 
gtgccacctc 
gattgtgtct 
gggtttgctt 
caattctgat 
gaaggatatt 
taagcattaa 
accctctgaa 
aaggtgccac 
ggggttaatg 
caaagccgga 
cggaaggfctc 
ggggcccaga 
ggcgggaagg 
cttgctccca 
agtccacggg 
agagaagctg 
ttcagtggct 
tgctgtgact 
tgccctgtgc 
gggggcccag 
gtggtagtag 
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ggactgaggc 

tccatcccag 

ccagcctacc 

ggtgataacg 

cacttgtctc 

cacaagtcta 

ccgctgttta 

ctccatggct 

aagcttgcaa 

ctagatagat 

ctggacatca 

tttgtttaat 

annnnnnnnn 

nnnnnnnnnn 

ctcctgctgg 

cacagtgtca 

gaactcgagt 

gattccctgg 

tgtaccgatg 

gagaacagca 

gagggggggt 

aagttccaca 

agaaacagat 

ttttctcctt 

tatatctaag 

cattatggat 

agaacagtca 

tgagttaggt 

caatgatact 

ttgctggtct 

gttatgtacc 

gtttctctgg 

ctcagaaatc 

cccagtttaa 

ctggatgttc 

cctctgcctc 

ttttttaatg 

cccagagcgg 

ttttaaaggc 

cacacccgta 

tgctggtctg 

cagtttgagc 

gcagagaagc 

aacaggaaat 

cttctctact 

aaattaaact 

gcctggttct 

gcctcactag 

tggcctgttc 

cagaaccaaa 

ctcttcagca 

acgtgcggtt 

tggagctcag 

tgcggagggg 

ctgcacccta 

cactgcgagg 

aatggcccag 

cccttcctca 

tgatgcccgg 

gcgttggctg 

gggtgaagca 

gactcgagtg 

cgtgcccttt 

ggctgcccag 



ttctgctcca 

gaagacagaa 

tatgttcatt 

cagaatgcac 

cttgaactaa 

gctgggggcc 

caagtatggt 

ttaggttcag 

caacagctcc 

ggggcaggac 

ggattagttt 

attctgagtt 

nnnnnnnnnn 

nnnnnnnnnn 

ggttacaggc 

agctggatat 

tggtgaaata 

tcacatgctt 

catgcttgca 

agtgaacaaa 

gtaaagctct 

tgtaaggtca 

gcagtctgtc 

cctttctttt 

tacactgtag 

ggttgtgagc 

gtgctcttaa 

tttgtgtagc 

ctgcctgatc 

ggaactaaca 

accacacctg 

gtagtcctgg 

cgcctgcctc 

gggttttttt 

tggaactcac 

ccaagtgctg 

tgtgtatttg 

aagtgttcfct 

accaaactct 

cacacccgta 

gcctgtgtat 

acaggctctc 

atctgcaagg 

ctcaggagag 

tcccagaaca 

ccttggtttg 

gctgctggca 

atccgcgtct 

aggatgatct 

cgagcctcgg 

ggggtgggga 

gtttccgtgt 

cgctggtgct 

taagaaaacc 

ccccccgctc 

agggccgtgg 

gggagcagaa 

ggacctgaca 

cagctggccc 

gctagtctcc 

gaagtcagga 

gaggtgcatg 

gaggaggggt 

aattaccgga 



gctctgggca 

aatccctaca 

taatacaaca 

aaactctgcg 

acagtgtctt 

ttcatgcttc 

taggagcccg 

agtcattctc 

aacacagaaa 

ggactgtcca 

aaacacttgt 

ttagaaccta 

nnnnnnnnnn 

nnnnnnnnnn 

ttgagccacc 

gatggcacat 

gtccaccagg 

taaggaaaga 

gccacccaca 

caaacaagcc 

tggcaggtgg 

tgtgatcgag 

atatctaaac 

ttctctccct 

ctgtcttcaa 

catcatgtgg 

ccgctgagcc 

cctgggtggc 

tgtcttaatc 

gagatctgcc 

gagtttaagg 

ctgtcctgga 

tgcctcccaa 

ttgtttgttt 

tctgtagagc 

ggatfcaaagg 

tatgggtgtg 

aaccgctgaa 

taggtaggag 

cacaccacat 

gagctggaac 

tggtttctgt 

ctgtgaccac 

tatttccttt 

ttgtgtgcca 

cagagtagca 

gagctgacac 

cagagtctgc 

tcctctgcat 

acgaagcccc 

caagagcaag 

ctgggagatc 

tcgcgctccc 

cccaggcttt 

ctcgttcatc 

ccaggctcag 

ggcggaaagt 

tggaggagct 

tggcgccggg 

gggagcgccg 

ctttactgca 

ggctgcggaa 

cctgtagccc 

acctgtggcg 



aggttacttc 

ctctcccttg 

actaaaatat 

gctgcagggg 

tgaggcagaa 

catgtgctta 

actgcctggg 

tgcatgcctc 

gtgctgtgag 

gtaagcaggg 

caggtggggc 

aactgtggga 

nnnnnnnnnn 

nnnnnnnnnn 

atgacagctt 

gcttgcaata 

taaaagagct 

acttgccgaa 

cacccacaca 

gggggggggg 

ctggctcctg 

tacatctggg 

cattgttgtc 

ttctctttta 

acacaccaga 

ttgctgggat 

atctctccat 

gttaccttaa 

attttgagat 

tgtttctgcc 

gttttttgtt 

actcgctctg 

gtgctgggat 

tttcctgaga 

aggttagcct 

cgtgtgccat 

tgggtgcttg 

ccatctctct 

actatacaca 

gaccatgcct 

caaaaccttt 

tctctgtcct 

gctggctggt 

taagaacgcc 

agtggcaagt 

tgggagcatt 

ttggctaaag 

aggagaaatc 

ttaagggcgg 

ctaaaggcag 

aggcgggatc 

acatgacccg 

cgccctgctg 

ctttcctttg 

ccagtcttcc 

ccttgcgctg 

ggttcttaca 

gctccggagc 

gggcactcat 

actgggaccc 

acgttcggta 

acggagactg 

tgaaactctc 

ccatgcatac 



tctgctcttc 

atctacccga 

ctattcacag 

agacggcaga 

cagggtgaca 

gtaattaatt 

ttggcctctc 

tgcctgtctc 

ggtcgacagt 

ttcatcatgg 

acttttacca 

aacaagagtc 

nnnnnnnnnn 

ngccttgaac 

tagcaatagc 

ctaacctcca 

tgctgcccaa 

gatgtcccct 

agtgcactgt 

ggattgtgac 

gtaacactcc 

cctccaacag 

gtatctctgg 

aaaaattatt 

agagggtgtc 

ttgaactcag 

cccccaaccc 

ctacactggc 

agggactcac 

ttgcaaatgc 

tgtttgtttt 

tagaccaggc 

taaaggcgtg 

cagggtttct 

tgaactcaga 

cactgcccag 

tggaagccag 

tccctcttcc 

cacacacaca 

gagcacacaa 

gtcgggagat 

gtgtcgcatc 

gctctgccat 

agacttttgt 

tattaaccaa 

gagagggtgt 

ggctggcatt 

agagagggga 

ctggtttgcc 

taggagagac 

tcgcccggcc 

catcagctga 

cgccccggag 

tcgctggttc 

cggcctggca 

cccccaggcg 

gcagggtccg 

gtggagagag 

gtagtggccc 

tgtccccggg 

cagaggctgc 

tcctacccgc 

actcggttca 

cacacttatg 



accattcctg 

ctttctgaca 

gcactaagct 
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gcctctgcca 

tccgttggta 
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cacacataac 

nnnnnnnnnn 

ttgcttctgc 

tttgtaaatc 
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tgggaataaa 
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gtggttttat 

ccgcagtctg 
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ctacattttc 

gcctgggcca 

gtgctttgga 

atgcctaaag 

tctgagatga 

gaaggtccag 

cacgtagccc 

tgagccttgg 

ctttagagac 

cccgtcacgg 
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gcgcagtcac 

ccccggaagc 

gccaggacca 
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cagccacagc 

gagggtgtat 

cagggatggg 

ggctttgctg 

gcatgatgta 

fccttctcttc 

ggctctgtct 

cctgagcgct 

tgggacaggc 

gaaagaactt 

cctcatgcag 

gctgccctgg 

caagttccga 

ggcatttgtt 

tgacgtgctc 

gctctgtgga 

gatgcagacc 

ttgctctatc 

ggaattfcgtg 

tcactggatt 

gaatgtgtgt 

gctatttgat 

tccactcatc 

ttcaggacaa 

tggctggact 

tctggattcc 

agcatcacct 

caaagctgcc 

cttcagtccc 

cctagggagc 

gagcctcttc 

caccagggtc 

gggtctctgc 

actcctacag 

actgtttgag 

cagccccttc 

tcttttatat 

tctgtggcag 

cctgcctttc 

cctatttgaa 

gcctctcatc 

cgactgtttt 

gctgccccat 

cctggtcggg 

tgcctttggg 

cccagactac 

cttccaagct 

cctgggccag 

tggggatgat 

ggacttgaag 

ggaggaggac 

gctgtccatg 

gccgctgaca 

ctgagggggc 

accagatcca 

ctgtgtgtga 

aaagccatac 

tactcgaggg 

aagtggtggt 

aagagtgttc 

gagaggcagg 

ggtgcaggcc 

gccagcctgg 

tcgaaaaaaa 



actgccccct 

ggatgcacct 

ccctgcccct 

gagtcgcccg 

gttaccttca 

cgtgttctga 

ttgcaccaca 

tacgagatgc 

attaagtctg 

cggggccttg 

ctgaatcggt 

gtggtggact 

ctcaacaagg 

gcaggtggtg 

tctgacatca 

catgtccgag 

tggacaccgg 

caccctgaca 

gctgcccatc 

gatcttacct 

ctgcacctgg 

cagccacacc 

ccccggctgt 

cttataaatg 

agagataggc 

atctccctcc 

ggcctgttgt 

gggttggacc 

atacaggcct 

cagttggagg 

catcgagaca 

cggatactgc 

atacgccact 

ctgagcggac 

tataggccgg 

agctccgtgg 

caggcccggc 

cagctgggtg 

gtgctgtcgc 

cccgttgcca 

ggtgcctatg 

gtggcccagt 

gtcctccagg 

accactgagg 

gaagagattc 

aggtcgggcg 

ggactctacg 

ctgagtgata 

ggcggtgccc 

cagagcgaag 

caggaggatg 

gagacggtgg 

gagcagacag 

atgggtcagg 

ctgtagcacg 

gtggaaagtg 

cttgggctga 

ccctgggtgt 

tgagatgtga 

cagacaggga 

gaaaggctaa 

tttaatccca 

tctacaaagt 

aaaaaaaaaa 



cagctctacc 

tcttgccagt 

ctcggggcag 

agatgctcta 

gccctgccaa 

gggccatgga 

ttgctgtaga 

cttccgagga 

aaaaagaggg 

tgctagactg 

tggcaggtcg 

ttaccacacc 

gagataagca 

caggaagtgg 

cgtactatgt 

cgcagtggga 

atgagtgcat 

tgcccgacct 

gagccctcct 

ttggctacaa 

tggacgctca 

cccaacgcct 

tggtccagcc 

gtgcgggcag 

ctgggacagg 

ccgggaaagc 

ctttttctgc 

ctggggaggg 

tggaagagct 

agcctgaaaa 

tgcaggtcct 

agcctgatgc 

ccaaggacat 

ccaaaagtcc 

tttcccaggg 

tccccttccc 

gtgtggagga 

cggtgttaaa 

tcatgtctga 

aggccctggg 

agagcccctg 

tggtggtgcg 

tactggctgg 

atgaggaaag 

agatggatgg 

tcagcttcca 

tggctgaatc 

agagcagtac 

ctgcggacaa 

gctctgagga 

aagtcacggg 

tggctcctgg 

aaggcaaaga 

tgcttttcct 

cacagccaca 

gggcttagat 

ggctgggagt 

cttatctcta 

atggagccag 

acagtgtgtg 

aatggttttc 

gcactcagga 

gagttccagg 

aaagagtgta 



tgccctagac 

gggtgaatcc 

ccctgcctgc 

tgtggtacac 

gctgaccaac 

tgcctgtcac 

cgagaagcta 

tgaaaaccag 

ggaagggaga 

ggtccatggc 

acggcagggg 

ttatgggcgc 

attggacttc 

ggagccaccc 

atacaaggcc 

accccacgag 

acccgagttc 

ggatgtgccg 

ggagagctgg 

actccagggc 

cacccatctg 

ggctggatct 

tattcgggag 

gcttgtcgta 

agaagatgat 

aggtgaccag 

accctcgggg 

tgaagagggc 

ggagaaagtg 

gcctcacgcc 

gggtgtcctg 

acctttgtgg 

ccccgtgtct 

catggtgtcg 

attaccccca 

tccatacttc 

tgaggtccag 

tgacatcact 

ggagcacacg 

ccccaaaaat 

ccgcctgcat 

gctgggcttg 

ggtggaggct 

tgagctcccg 

gcagccggct 

tgaccaggcc 

tccacagccc 

cagcgaagcc 

gaacagcgtc 

agaggaggag 

aacatccgag 

tgatgggaga 

acaaaagatc 

tcaggctctc 

gtcagacaca 

ttagctttca 

ggagttgagt 

gccccagaag 

aactggccgg 

tgtacctctg 

ttaagagagt 

ggcagaggca 

acagccaggg 

gaagggtgga 



tctatacgac 
atcccatgtc 
cccagccttt 
ccttatgtgc 
agccaagcca 
cgccaggggc 
tgcagtgagc 
gagggctctg 
actgagtgtc 
cgaatcagca 
gatcccaact 
ttccgagacc 
acctatgaga 
catgttcctc 
cgtcgcacac 
tatcctgcca 
tacacggacc 
gcctggtgca 
gaggtgtccc 
aaagaagctg 
accagctatg 
cctgccctgg 
gccacaggcc 
gaggccactc 
ttagaacagg 
ccaggctctt 
tctcgaccag 
aagattgtcc 
ggtaacttcc 
cagccacctg 
ttggctgaga 
gtacgctttg 
ctgcagcctg 
aagaagggca 
cccagcccag 
ccagcactgc 
ggtcgggagc 
cccgagggct 
gctgtgtaca 
gccaacaagt 
ggccgcttct 
caggccttcc 
tcccaggagg 
gtgtccgggc 
gcttcctcag 
gacctgccgg 
caggaggctg 
tcccagggcg 
aagtcagggg 
gaggaaggct 
ctcactctgt 
gacagagaag 
ctccttggtg 
atatgctggg 
gtgcatggaa 
ggagacagaa 
ggtagagcac 
aagtattaag 
aacagtcggg 
aggcfcctcat 
ccagaagggc 
ggcggatttt 
ctatacagag 
agccagggac 



aagctctcca 
tatcaaatgt 
tgcgagctga 
aattctccct 
aggtgctctt 
tggcctgtgg 
tccggctgga 
aagagaaaaa 
ccacctgcca 
acttccacta 
atcacccagt 
ttcgtaaatc 
tgacccggca 
accacatctc 
cgcgctcggt 
ccatggagcg 
cctctatctt 
gttctaacca 
aagacctgca 
tgaaggagaa 
gcgtggtaca 
cccctgaacc 
aggaggacat 
catgtgagac 
ctacagaagc 
cctccagtca 
gccgtaggag 
ttccagaggg 
tggccaaagg 
tgcacctgca 
tggtgtttgc 
aggctgttcg 
tgctagacac 
agctagaccc 
cccagctcct 
acaagttcat 
tggcgtttgc 
tagagatcct 
cagcctggta 
acctcctgaa 
acctgtacac 
tcacccacct 
agggcaaagg 
ctggctcctg 
gactggggct 
acacggagga 
aggccgtgag 
aggagagggg 
acagcagcca 
gtgtggtgtt 
ctgacacgat 
aggaagagga 
agcccgtggg 
tgtgggtcca 
tgtggaagtg 
agctccttta 
ttgtctggta 
aaataaaagc 
tggaagtggg 
ggttccatca 
tgggcttggt 
tgagttcgag 
aaaccctgtc 
aagtctgtac 
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aagaaggaac 
ccagtttctg 
ttgctagcag 
ttatctgatg 
tggctgtctg 
cgcctgctga 
ccagccaggc 
cactccctga 
aagagcccag 
ctgccctcca 
gttcaccgtc 
gccagtccta 
cctgtatgga 
tccctggttc 
gtcttggggg 
tctaggtagc 
ggctgctggc 
tcatggacat 
cctccttcgt 
ccaggaaggg 
ttgggggtag 
tctgagacta 
tctgacttaa 
cactttccca 
tgggggctgg 
ggttagartc 
ccaccctcct 
aaaaaacacc 
ctctgactaa 
agccttacat 
aagggctggg 
gtcctatgcg 
atggtccagc 
catgagcttc 
ggccaggcca 
ctggatccta 
cgacgaccag 
gcgtacacaa 
cacagagttg 
tcagcaatgg 
ctttaactcc 
ttgctcaggc 
tgtgagtgtg 
catctctcta 
ttttgtagac 
gctggccttg 
gtgtaccact 
cctcccctgt 
cgtaggaatt 
ttatctatct 
acttggggag 
gcaggaactc 
tggcttgctc 
tatgcctggc 
cccctgcctc 
gggtggcact 
cctccatagg 
ctgtagctga 
accaaacaaa 
cagcaggaaa 
gatggtctcc 
ttatgggatg 
gaatattttg 
teattgtgtg 



ttgggagcat 
gggaccttga 
agccacagcc 
agctcaggac 
ccaagcttgg 
catcttgtta 
ttctgcccat 
gcagggcctg 
tattgaatga 
ctggcttctc 
agcagtgatg 
ggtgacatcg 
gaacccgttc 
gtcaaacccc 
ttccaggagg 
cccagggagc 
agcggtgaca 
tctgccccgc 
cacagggtag 
ctaggaagct 
tggcacaggt 
gcctggtcta 
gaaaagagtc 
gcatgcagga 
aaagaatagc 
ccagctgcca 
tcggtctctg 
catacacata 
gctacactgc 
gcagctcctg 
aactcttgca 
tgaaaaccat 
agcacctgag 
ggcagcaggt 
gggcagtgga 
agggctgtac 
tggaccccac 
tctacgtacc 
gtgactacat 
aatcttttta 
agctccttcc 
ttcttncatg 
gaggtgacag 
gatccagttt 
agggtctcfcc 
agctcacaga 
cctgcctggc 
gctgggatta 
ttaggatcag 
tagttaggat 
gaaaaggttt 
gaggcagaag 
agtctggtgt 
tgtcctagaa 
tgcatctcaa 
acccactgta 
ccaatctggt 
tgccaaattg 
ccaaacaact 
ggctcaggag 
agaagctacc 
ggcgctgtac 
ctcctacaga 
tctgatgcta 



tgccgaaagg 
ccatggctag 
gagttggctg 
cttttcctgc 
ccccacagta 
tgttggtaag 
ccttagccct 
ggtctcaccc 
atagaagcca 
aacgctgctg 
acacccctcc 
tgtcggggcc 
tcacctacca 
ggcttggggg 
tctgtggggt 
aactcaaacc 
ctgacgcaga 
attagccacg 
gcccctgctg 
cagggagaag 
ctttcattcc 
cagagagagc 
gtggttcatt 
ggagctatgc 
tcagggttta 
catggtagct 
tgggcactgc 
aaattagacc 
ttccctgtgc 
ttatttgaag 
ctttgattct 
cagtctcatc 
tgagccagtg 
aggcaggcag 
cccactgaat 
tgagggccag 
cctgctggaa 
tttctcctgc 
ctcttccctg 
tttttatttt 
agcttccacc 
ttcttatttt 
aataacagtt 
ttggttttgt 
tgtgtagcct 
gattcacctg 
tttgtttttg 
caggtgtgtg 
agtgaccaga 
ctctgttgaa 
attttgtctg 
ctgaagcaaa 
accccctccc 
ctcactctgt 
gtgctaagat 
aattggtccc 

ataaaacaaa 
aaccaagaca 
ctggccattc 
tgcaaaggaa 
tgggcttctc 
attgtaagtt 
gaatggtgcc 



atgacctctc 
gtgaatggac 
ggtggggtgg 
cctgcagata 
gcctctcgcc 
gtctgtggtt 
ctctaggcga 
attaagctgg 
ccccacagtc 
cccttccttg 
actgaatgcc 
tgtgctcagc 
gtacctgccc 
tgggggcaag 
gacctgtccc 
ccagccgact 
aaatcatcgt 
aggtcttgct 
cttgggagag 
cagataccgg 
agcacccaga 
tccaggctat 
ctgggttgtg 
ttgagttcca 
agagcactgg 
cataaccatc 
aaacatgtgc 
aaaaaagttc 
ctcagtt tec 
ttcctggtaa 
aggttcccca 
gccctcatct 
gccaccttct 
cttctgggct 
ctgtggtctt 
ctgecagagg 
gagctgeaga 
ctgttgggta 
gggtgggccc 
gacatggggt 
gtctcctgtt 
taaatgacct 
ggggggtcag 
gggcttttat 
ggccatcctg 
cctctgcctc 
ttaaccacca 
ctatcacacc 
tttggtccta 
aatcatggtg 
acaactctca 
agccatggaa 
cacccacctc 
agactaggct 
taaaggcggg 
ttcccatatc 
tctcagttga 
tctcaaacca 
gtgacttata 
aagtctgggg 
tgaacagctt 
tctgagtagg 
cccagaggca 
tggcatacac 



tgeaggtect 
ccaggatggg 
ggtggggtgg 
cagcctgcaa 
atgtggcccg 
agtgctggag 
ctccttccct 
gttttcttgg 
teagaaggqe 
gtagggecca 
ggcaacatct 
tgcctcctcc 
tacatcagct 
gatccaagga 
tccctcatct 
gaacagccgc 
atacctctct 
gcctgtgctt 
ccacctggct 
cctgagtcat 
ggagggcaag 
ctaaggctcc 
ggtgtggctt 
gcccttcaga 
ttgctcttcc 
eggcagttet 
acagacatac 
atgttctctc 
tcccctggtc 
attggtcaag 
gtggggccca 
gettgegcat 
tccaagtctt 
gggtgggcca 
cctacccgca 
cgaccttctc 
aggtgttcac 
ttgcccatca 
egatgettte 
ctcatttagc 
ggcattgtag 
gtgtgtgtgt 
cagatgeett 
gtgtgtgttt 
gaactcattc 
ccagtgctgg 
tcctcctgcc 
cagctaacag 
gggeccaatt 
gaacatcatt 
ggtcaccaag 
gaactctggc 
cccacaatgg 
ggcctcaaac 
tgccatcacc 
agttgttaaa 
gggtttctct 
ccaccaccaa 
aagagaatct 
aacagaatgt 
gctgggtttt 
atgggccacc 
ggacacatct 
gtgtgtgtct 



geccgaggag 

atggtcaggc 

gaagggtgag 

gatggtcege 

gaacctgctg 

accaggttcc 

aacttcccag 

gtaagtgggg 

ggcttccctc 

ctcgacagca 

accagaagag 

acattgecta 

acctggtcag 

ecagccccag 

attctgtggt 

aaggaggccg 

gacacgaccc 

ggcttcctca 

gagggggece 

ggttctgatg 

tttctgtgag 

atagtaagac 

ggtgatggga 

aaaacaaaaa 

agaggatcca 

atggaacctg 

atgcaggcag 

ctacctgtag 

tggactgatc 

tccttcaggg 

ggcccggact 

egggcaggag 

ctctcatctg 

ggccaggcca 

ggatctgeca 

tgatgggcag 

cctggaaatg 

cgttcctttg 

acctccagag 

ccaggctgac 

tcatgtggca 

gtgatatctg 

gectgatgag 

gtttgtctgt 

agtagaccaa 

gattaaaggc 

tcagcatctg 

tggatttaaa 

tccacagtga 

accaaatgea 

ggaagtcagg 

ttgttcctca 

tttctctget 

tcaagagatc 

cctgccccag 

taagaaaact 

tctcaaatga 

caacaataaa 

gaacattttc 

aggggaatat 

gtggcttccc 

ctgtagttgg 

gtcttattct 

ctatagagac 
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agcactcatg 
gacgagaact 
ctaagctctg 
ccctgtaggt 
agggctctat 
catggctagt 
ccactcaggg 
gccccagagt 
caacaggaat 
cgggaactgg 
ccacttccac 
ggccgccctg 
ctggccgctg 
ccagcaccgc 
ctgtgatggg 
tgttcgacgg 
acgtacaatg 
ggacttagaa 
tcagttgctg 
agtatggctt 
tggaggaggg 
cagactgttc 
tgagattgtg 
tttgaagtag 
aagacccttc 
atgcctgccc 
gactgcagga 
tctaggtacc 
cagccctgtg 
ggagggtcag 
gcagtgatgg 
tgggtggagg 
gtgttgtggc 
ttctacgagg 
tgaggtccta 
ggtgtccctt 
agctcaggaa 
cctgatgctc 
ggatcagggg 
ttactgcctg 
gcaggaacag 
cagagctcag 
gcagtgggaa 
acagggaagg 
gtctctaaat 
tagtcccagc 
atgccaggtg 
cagggggttc 
acacacacga 
catgttagta 
tttatttttt 
atcgcaacac 
acagagcaag 
acaaaatatt 
ggctgtcctg 
tgcttctgcc 
ttatagattt 
aggggtcaga 
ccatgctggg 
ctagagtttt 
tttcgagaca 
gctggcctcg 
gtgcgccacc 
ttgttttgtt 



tctacgtatc 
taataaccca 
ggtagatagc 
gacatcatcc 
ctggaaagca 
gccggccctg 
acctttggga 
cctgggccac 
gaagacaacg 
ctggcgtact 
cagatccgcc 
agcagtgaag 
tacaactatg 
aaaagcgtct 
gcagtgcacg 
ctgctttact 
gcgtattttg 
tctgtgggac 
caggcgtctg 
tggcttcttg 
tcagggcaga 
tggggctgga 
ttttctgagg 
ccagggctct 
gcacagtgga 
cacacaccag 
agccaggctt 
agtcaggaaa 
tgggtggggc 
gcctactggt 
atggcgacgg 
gctgaaccct 
tggcttctcc 
ctggccagcc 
tcctttcatt 
ataactactc 
ccagctgcac 
cacagcgatg 
tcacgccctg 
gagaccactg 
gcgcaggcct 
caggttctgg 
ctctcctccc 
caggctgccc 
cccaacattc 
tctcggagtc 
ttctttgaga 
aacacccttt 
acacaaataa 
cgattggatg 
ggtcttaaaa 
tcaggagcag 
tttcagggca 
taaaagattt 
gaactcactg 
tcctaagtgc 
ggtgttttgc 
ggcgagtatc 
actgggtcat 
tttcccccct 
gggtttctct 
aactcagaaa 
acgcctggct 
tttttctttt 



gataaaggaa 
gagtagccca 
taccttgcca 
ggaaaatcat 
tgagcccgag 
aatgggaccc 
gtgtcctggt 
tgggctccct 
ccctgaagcg 
ggcagtacga 
tgcagagctt 
acttctttct 
gggacgggac 
tctacgtggg 
tctgggaccc 
gtgccttagc 
acggggaaga 
ccaagtttga 
ccttctacct 
tacgacgtca 
cggaggtcgc 
cccttgactt 
gtcactgggg 
ttgatagcca 
tccttcagac 
catcaccatg 
gcaggtcagg 
gacactcagg 
gctggcacgg 
ttcgtgggcc 
gcacttgaac 
gggcttgttc 
tcgggcttca 
catgaagggg 
tctacttagg 
ggggacattc 
aggaggcagg 
ttctaatgag 
tccgtgaccc 
agcagagtcc 
gcgcatcgtg 
gaggggagaa 
ccgccgtcca 
ccagctctat 
taagaaagtg 
ttctctgagc 
ggacctgagt 
atggcttctg 
aaataaatgt 
tggcagtgcc 
ttttaaaaga 
aggcaggtgg 
gccaaggtta 
acttacttct 
tatagaccag 
tgagattaaa 
ctgcttgtgt 
agctcttcct 
ttgcaagagt 
ggctgtcctg 
acataggcct 
tccgcctgcc 
cagaatctgt 
ctttttttgt 



gctgttttgg 
gtcagtacct 
tcttccctga 
ccccaaccat 
ctctcgaaac 
tcagagtggg 
tggaaatcgc 
ctctggagtg 
ggagcl'gcct 
gatcggtgtg 
cccagggcac 
gagtggcagc 
caatgagacg 
ccagcttgag 
cttcacaggt 
caggcctctg 
ttcagtgagg 
atccactccc 
gtaagaacca 
cctgtcgtcg 
agctagttag 
cactgtggaa 
aatgggatgc 
ctaagtcccc 
agccgggtgc 
gccagctccg 
aggggtgcag 
ggactccacc 
atggggcttt 
tgaatggggt 
acaatctcct 
gctcgttggc 
tggtgctcct 
acattctaca 
gcctggtctg 
agtggggtgg 
ctaggggcag 
taacccttgt 
agttcaggtt 
atgccccctg 
ttcctgcctc 
atagggctgc 
gatggaggtg 
ccaggacccc 
tcaggatggc 
accagttctt 
ttggttctca 
tggacacata 
taaaagaaga 
tcacagaatc 
ttggtttagg 
atctctatgg 
cacagagaaa 
tatttgagct 
gctggcacct 
gtagtgttat 
atatatgcac 
ggaactagag 
tacaagtact 
aagtagaatc 
ggctgtcctg 
tctggctctc 
ttttaaatga 
ttttgttttt 



ggggaggaaa 
tgccttggct 
tcttagaact 
gagttggtcg 
ccagccagca 
agctgtctcc 
atccagatcc 
ggtagtagcg 
cggagtgcqc 
agccag.cagg 
acgggggccg 
aaggaccgga 
gcttcccgcc 
gccccgcagt 
gagcgggccc 
ggaacgggac 
caggaagaga 
ccaacttacc 
aaaatttaga 
ttgtaaagag 
agcatgctat 
gcagcaagat 
aggtgtgggg 
agatgtgtcc 
ccctgacggc 
actccactct 
ttcctgggct 
aggaacgctg 
tctcttccgg 
gagctgcagt 
cctatagcat 
cgtcagcccc 
agatacccgc 
gatcaaggtg 
ggagaggaca 
gaaaatggcc 
gaatcagggc 
ccatatttgt 
aaataaagcc 
ctgggctgtc 
ctatattcaa 
ttgtgggagg 
aaagacaggc 
aggggaccct 
tctggggtca 
ttcctatggg 
gcactgtcta 
ttcttatgtg 
cagcggcacc 
tgtgggactt 
agtggtggta 
gtttgaggcc 
ctctttctca 
aggatctccc 
taaactcacg 
accatgcccc 
taccttcatg 
ttatggaagg 
tctgagccat 
tgttcttgtt 
gaactcactc 
agaatactgg 
gagtaatagt 
tggctcattt 



caggcttaca 
tctgttgttt 
ttccccactc 
gggagctggc 
tggaacccac 
aggacgatgg 
ctgactctca 
gaggcctcag 
atgggctgag 
atgcccactt 
tcaaatgcgt 
ctgtgcgcct 
tcatctatgc 
atgtggtgag 
aggtgaggcc 
ctagtgcgaa 
agaagagtca 
agcaatcggc 
agattccacg 
aagtatcgag 
gtgaagagag 
gagaaagccc 
tgagttggag 
tttttcagga 
tgtggctgtc 
gcgctttgtg 
actgggggtc 
cagtgacagg 
ggatggagtg 
agggtgggtg 
gagttccgac 
agtggccgga 
acgggcctgg 
actgactgcc 
ggtttatgct 
ctcgtaggcc 
tagaactgac 
cttgcttgga 
aggaggctgt 
ctgatggggg 
tcatagacct 
atttctccct 
actgttgctt 
gggtctcagt 
tcctgggtgt 
aagtaaggac 
gctctggctc 
gcctgcacgc 
ttgtacctca 
tatttattta 
cacaccatta 
agcctggtct 
aaaaataaaa 
tattagccct 
aagatcctcc 
actattttct 
cagtgctaat 
ttgggagtca 
ctccagcccc 
ttgttttgtt 
tgtagaccag 
aattaaaggt 
tacaggtttt 
gttttatttg 
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ttttgagaca 
ctgactttaa 
tctgatggtt 
atatcaccca 
cacagagatc 
ggtgagtacc 
gggcagcgtg 
acagaagccc 
cggcagcgag 
gccaccctct 
tctggctttg 
cctcctggca 
ggcccgctcc 
tacccacatg 
gcttcatgaa 
tgtgatcccc 
aggaaatgaa 
ctctgtttct 
ccaggctcag 
attcattcag 
gggaaggggt 
cctaaccaca 
ggacaggagg 
tctgccatct 
actagttgca 
tgaggctgtg 
ccaaggagct 
tggcttggct 
ccttcctgct 
ttttttaaag 
caccatactt 
cgaagttaat 
gcaagagcag 
ttgatacaag 
cctacctacc 
catgcccacc 
aaacacagca 
tgggttgggt 
gctgaagagg 
tggaacctag 
gctcctgcag 
tggtggtggt 



ggtctcactc 
acttgttttc 
gttaataaca 
gacagtggct 
tgcctcccga 
caagcctcca 
ctcatcagct 
acgcaccact 
gtggtcaccg 
caggccacca 
ctgcccacga 
tagggccagc 
tcattcttga 
gcctgctgtc 
actcgggaga 
attgctcccc 
aggagttggg 
cccttgggta 
gagggctggc 
ccaggtcttc 
tatatgggcc 
gcgtttgtgc 
cagcgggaat 
cttgagtgtt 
gtctatgtaa 
ttgtgttttg 
ggggaactgg 
ttttttcaga 
tgaaacagag 
gcattgattt 
gtgtctggtc 
gaatggttat 
ccagtgttct 
gtctggtagc 
ctcttctctc 
agtttgcctt 
agagggtaga 
ggccttgggt 
tgggaccctg 
gctggaggga 
cctccccttc 
ggtggtggtg 



tgaaccccta 
tgagtgctgg 
atattattta 
ggtctggaaa 
gttcagaaat 
ttgatgctat 
cctcttccga 
acaagtcagc 
gcactgtagc 
caaagctcag 
aacgccacct 
caggagttgg 
tctcgaagcc 
tgggattgca 
tactgttcct 
agttctgcct 
cctttgctca 
ggaaactgct 
caggacctac 
agtatgggac 
cagctggtcc 
cttcagccgg 
catctctgta 
cgtttggaga 
ataaggtcaa 
ctttgcctag 
tatatgtcac 
tggtctcaag 
tagtgggctt 
ttatacgtgt 
cctgcggagg 
gagccacatc 
tacccactga 
tcaaacttga 
taccttccaa 
gatggcacca 
ggcagccatt 
cagttttccg 
ggagggccgg 
gccaaccagg 
ccttgggtag 
gtgggggggg 



ggtggcctgg 
atttatgggc 
gcttttcttt 
tcactgtgta 
taaaagcact 
ctgtccctcc 
ccattccttg 
gtccgaccca 
caacaagatt 
ttccgagaac 
cctgctgggc 
ctgagggcag 
gattcttcta 
cagctcctga 
aaccagcaag 
tctggattca 
gccagcttcc 
cctgtctagg 
gactttcagg 
cagaaaaaag 
aggaatgaat 
ggaggcagcc 
tctcgggttc 
agcctgagat 
taaattcttt 
gctgggctca 
tgtatatgtc 
tgcctcaggt 
ataggcatga 
atggttgttt 
tcagaagagg 
tcgatgctga 
gccatctcta 
tatgcagccg 
gtgctgatat 
gagtcaggaa 
gtcctctggc 
aatggttcac 
cctccagctg 
atcctggact 
ctgtggtggt 
gggngnngnt 



agcttgctat 
ttgtgctatt 
tggagatagg 
ggccaggctg 
ctgggatggt 
cgctctctgc 
actgtttgga 
atccacacct 
ggtgtctgtt 
ttccgtggca 
tcggacaatg 
ggcgagatga 
ggcaagcccc 
atctccaaag 
aattggggca 
catggggaca 
tctagccacg 
gttctgatgg 
gcttggtctg 
gggatgtgag 
ccatgccttg 
cttgggacca 
tgcccagggg 
gtggcccctg 
ggaagagcca 
ggcagctctg 
actgtgcctg 
tggccttgat 
cccaccaggt 
tgcccacttg 
gcatcgggat 
agattgaacc 
agccccacac 
aggaggttga 
tatacatagg 
agtccaaacc 
tgccttggat 
ccttggggaa 
ggtctctcca 
tgctacagtt 
ggtggtggtg 



gtagaacaca 
ttgcccagcc 
ctctcactgt 
accttgaatt 
tttggagttt 
aggctgtaga 
aggagctgga 
ttgacctgta 
ccctgcttga 
cgctcactag 
gcatcatccg 
catctctcag 
aggctctggc 
ccttgaagtg 
aggaaagcac 
gggcagctcc 
ctctccttag 
tactgggact 
gggttttagc 
aacagggcta 
ccttggtacc 
gcatccctag 
atgggcagac 
ctgccttctc 
cggagctgag 
cctcagcctc 
gcttatggct 
cttgggatga 
ccaattttta 
tacatatgca 
cacctggaac 
tggatccttt 
ccagcttctt 
cctggtattc 
catggatagt 
tggtagttgc 
acagagcttc 
agggaacact 
gccctcgcct 
aggtgaacag 
gtggtggtgg 



213540 
213600 
213660 
213720 
213780 
213840 
213900 
213960 
214020 
214080 
214140 
214200 
214260 
214320 
214380 
214440 
214500 
214560 
214620 
214680 
214740 
214800 
214860 
214920 
214980 
215040 
215100 
215160 
215220 
215280 
215340 
215400 
215460 
215520 
215580 
215640 
215700 
215760 
215820 
215880 
215940 
215980 



<210> 17 
<211> 473 
<212> PRT 
<213> Mus sp. 

<400> 17 

Met Lys Arg Ala Ser Ser Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Thr Gly lie Pro Ala Ser Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Gin Ser Cys 
65 70 75 80 
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Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Ala Leu Ala Arg lie 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu His Val Val Asp Pro Thr Thr Phe His Gly 
115 120 125 

Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Arg Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 ~ 155 160 

Leu Gin Asp Asn Asn Leu Gin Ala Leu Pro Asp Asn Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Pro Ser 
180 185 190 

Val Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn His Val Ala Arg Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met 
225 230 235' 240 

Leu Pro Ala Glu Val Leu Met Pro Leu Arg Ser Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn 
275 280 285 

Leu Pro Gin Arg Leu Ala Asp Arg Asp Leu Lys Arg Leu Ala Ala Ser 
290 295 300 

Asp Leu Glu Gly Cys Ala Val Ala Ser Gly Pro Phe Arg Pro lie Gin 
305 310 315 " 320 

Thr Ser Gin Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys 
325 330 " 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Thr 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His lie Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr Ala Leu Arg Pro 
385 390 395 400 

Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg 
405 410 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
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420 

Gin Ala Gly Ser Gly Ala Ser Gly 
435 440 

Ala Leu Pro Ala Leu Ala Cys Ser 
450 455 

Val Leu Trp Thr Val Leu Gly Pro 
465 470 



425 430 

Thr Gly Asp Ala Glu Gly Ser Gly 
445 

Leu Ala Pro Leu Gly Leu Ala Leu 
4 60 

Cys 
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